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The invention relates generally to polynucleotides and polypeptides, as well as vectors, 
host cells, antibodies, and recombinant methods for producing these nucleic acids and 
polypeptides. 



The invention is based in part upon the discovery of novel nucleic acid sequences 
encoding novel polypeptides. The disclosed FCTRl, FCTR2, FCTR3, FCTR4, FCTR5, FCTR6 
and FCTR7 nucleic acids and polypeptides encoded therefrom, as well as derivatives, homologs, 
analogs and fragments thereof, will hereinafter be collectively designated as "FCTRX" nucleic 
acid or polypeptide sequences. 

In one aspect, the invention provides an isolated FCTRX nucleic acid molecule encoding 
a FCTRX polypeptide that includes a nucleic acid sequence that has identity to the nucleic acids 
disclosed in SEQ IDNOS:l,3,5, 7, 9, 10, 11, 12, 14, 16, 18, 20, 22, and 24. In some 
embodiments, the FCTRX nucleic acid molecule will hybridize under stringent conditions to a 
nucleic acid sequence complementary to a nucleic acid molecule that includes a protein-coding 
sequence of a FCTRX nucleic acid sequence. The invention also includes an isolated nucleic 
acid that encodes a FCTRX polypeptide, or a fragment, homolog, analog or derivative thereof 
For example, the nucleic acid can encode a polypeptide at least 80% identical to a polypeptide 
comprising the amino acid sequences of SEQ ID N0S:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25. 
The nucleic acid can be, for example, a genomic DNA fragment or a cDNA molecule that 
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includes the nucleic acid^uence of any of SEQ ID NOS: 1, 3, 5^9, 10, 1 1, 12, 14, 16, 18, 
20, 22, and 24. W W 

Also included in the invention is an oligonucleotide, e.g., an oligonucleotide which 
includes at least 6 contiguous nucleotides of a FCTRX nucleic acid {e.g., SEQ ID NOS: 1, 3, 5, 
7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24) or a complement of said oHgonucleotide. 

Also included in the invention are substantially purified FCTRX polypeptides (SEQ ID 
NO: 2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25). In certain embodiments, the FCTRX polypeptides 
include an amino acid sequence that is substantially identical to the amino acid sequence of a 
human FCTRX polypeptide. 

The invention also features antibodies that immunoselectively-binds to FCTRX 
polypeptides, or fragments, homologs, analogs or derivatives thereof. 

In another aspect, the invention includes pharmaceutical compositions that include 
therapeutically- or prophylactically-effective amounts of a therapeutic and a pharmaceutically- 
acceptable carrier. The therapeutic can be, e.g., a FCTRX nucleic acid, a FCTRX polypeptide, 
or an antibody specific for a FCTRX polypeptide. In a further aspect, the invention includes, in 
one or more containers, a therapeutically- or prophylactically-effective amount of this 
pharmaceutical composition. 

In a further aspect, the invention includes a method of producing a polypeptide by 
culturing a cell that includes a FCTRX nucleic acid, under conditions allowing for expression of 
the FCTRX polypeptide encoded by the DNA. If desired, the FCTRX polypeptide can then be 
recovered. 

In another aspect, the invention includes a method of detecting the presence of a FCTRX 
polypeptide in a sample. In the method, a sample is contacted v^th a compound that selectively 
binds to the polypeptide under conditions allowing for formation of a complex between the 
polypeptide and the compound. The complex is detected, if present, thereby identifying the 
FCTRX polypeptide within the sample. 

The invention also includes methods to identify specific cell or tissue types based on their 
expression of a FCTRX. 

Also included in the invention is a method of detecting the presence of a FCTRX nucleic 
acid molecule in a sample by contacting the sample with a FCTRX nucleic acid probe or primer, 
and detecting whether the nucleic acid probe or primer bound to a FCTRX nucleic acid molecule 
in the sample. 

In a further aspect, the invention provides a method for modulating the activity of a 
FCTRX polypeptide by contacting a cell sample that includes the FCTRX polypeptide with a 
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compound that binds to tJ^FCTRX polypeptide in an amount su^^nt to modulate the activity 
of said polypeptide. Th^Bmpound can be, e.g., a small molecul^Bich as a nucleic acid, 
peptide, polypeptide, peptidomimetic, carbohydrate, lipid or other organic (carbon containing) or 
inorganic molecule, as further described herein. 

Also within the scope of the invention is the use of a Therapeutic in the manufacture of a 
medicament for treating or preventing disorders or syndromes including, e.g.. Colorectal cancer, 
adenomatous polyposis coli, myelogenous leukemia, congenital ceonatal alloinmiune 
thrombocytopenia, multiple human solid malignancies, malignant ovarian tumours particularly at 
the interface between epithelia and stroma, malignant brain tiunors, mammary tumors, human 
gliomas, astrocytomas, mixed glioma/astrocytomas, renal cells carcinoma, breast 
adenocarcinoma, ovarian cancer, melanomas, renal cell carcinoma , clear cell and granular cell 
carcinomas, autocrine/paracrine stimulation of tumor cell proliferation, autocrine/paracrine 
stimulation of tumor cell survival and tumor cell resistance to c5^otoxic therapy, paranechmal 
and basement membrane invasion and motility of tumor cells thereby contributing to metastasis, 
tumor-mediated immunosuppression of T-cell mediated immune effector cells and pathways 
resulting in txmior escape from immune surveilance, neurological disorders, neurodegenerative 
disorders, nerve trauma, familial myelodysplastic syndrome, Charcot-Marie-Tooth neuropathy, 
demyelinating Gardner syndrome, familial myelodysplastic syndrome; mental health conditions, 
immunological disorders, allergy and infection, asthma, bronchial asthma, Avellino type 
eosinophilia, lung diseases, reproductive disorders, male infertility, female reproductive system 
disorders, male and female reproductive diseases, hemangioma, deafness, glycoprotein la 
deficiency, desmoid disease, turcot syndrome, liver cirrhosis, hepatitis C, gastric disorders, 
pancreatic diseases like diabetes, Schistosoma mansoni infection. Spinocerebellar ataxia, 
Plasmodium falciparum parasitemia, Corneal dystrophy - Groenouw type I, Corneal dystrophy - 
lattice type I, and Reis-Bucklers corneal dystrophy. The Therapeutic can be, e.g., a FCTRX 
nucleic acid, a FCTRX polypeptide, or a FCTRX-specific antibody, or biologically-active 
derivatives or fragments thereof. 

The invention fiirther includes a method for screening for a modulator of disorders or 
syndromes including, e.g.. Also within the scope of the invention is the use of a Therapeutic in 
the manufacture of a medicament for treating or preventing disorders or syndromes including, 
e.g., Colorectal cancer, adenomatous polyposis coli, myelogenous leukemia, congenital ceonatal 
alloimmune thrombocytopenia, multiple human solid malignancies, malignant ovarian tumours 
particularly at the interface between epithelia and stroma, malignant brain tumors, mammary 
tumors, human gliomas, astrocytomas, mixed glioma/astrocytomas, renal cells carcinoma, breast 
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adenocarcinoma, ovariai 
carcinomas, autocrine/p; 




stimulation of tumor cell survival and tumor cell resistance to cytotoxic therapy, paranechmal 
and basement membrane invasion and motility of tumor cells thereby contributing to metastasis, 
5 tumor-mediated immxmosuppression of T-cell mediated immune effector cells and pathways 
resulting in tumor escape from immune surveilance, neurological disorders, neurodegenerative 
disorders, nerve trauma, familial myelodysplastic syndrome, Charcot-Marie-Tooth neuropathy, 
demyelinating Gardner syndrome, familial myelodysplastic syndrome; mental health conditions, 
inmiunological disorders, allergy and infection, asthma, bronchial asthma, Avellino type 

10 eosinophilia, lung diseases, reproductive disorders, male infertility, female reproductive system 
disorders, male and female reproductive diseases, hemangioma, deafiiess, glycoprotein la 
deficiency, desmoid disease, turcot syndrome, liver cirrhosis, hepatitis C, gastric disorders, 
pancreatic diseases like diabetes. Schistosoma mansoni infection. Spinocerebellar ataxia, 
Plasmodium falciparum parasitemia, Corneal dystrophy - Groenouw type I, Corneal dystrophy - 

15 lattice type I, and Reis-Bucklers corneal dystrophy. The method includes contacting a test 
compound with a FCTRX polypeptide and determining if the test compound binds to said 
FCTRX polypeptide. Binding of the test compoimd to the FCTRX polypeptide indicates the test 
compound is a modulator of activity, or of latency or predisposition to the aforementioned 
disorders or syndromes. 

20 Also within the scope of the invention is a method for screening for a modulator of 

activity, or of latency or predisposition to an disorders or syndromes including, e.g.. Also within 
the scope of the invention is the use of a Therapeutic in the manufacture of a medicament for 
treating or preventing disorders or syndromes including, e.g.. Colorectal cancer, adenomatous 
polyposis coli, myelogenous leukemia, congenital ceonatal alloimmune thrombocytopenia, 

25 multiple human solid malignancies, malignant ovarian tumours particularly at the interface 
between epithelia and stroma, malignant brain tumors, mammary tumors, human gliomas, 
astrocytomas, mixed glioma/astrocytomas, renal cells carcinoma, breast adenocarcinoma, 
ovarian cancer, melanomas, renal cell carcinoma , clear cell and granular cell carcinomas, 
autocrine/paracrine stimulation of tumor cell proliferation, autocrine/paracrine stimulation of 

30 tumor cell survival and tumor cell resistance to cytotoxic therapy, paranechmal and basement 
membrane invasion and motility of tumor cells thereby contributing to metastasis, tumor- 
mediated immunosuppression of T-cell mediated immune effector cells and pathways resulting 
in tumor escape from immune surveilance, neurological disorders, neurodegenerative disorders, 
nerve trauma, familial myelodysplastic syndrome, Charcot-Marie-Tooth neuropathy. 
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eosinophilia, lung diseases, reproductive disorders, male infertility, female reproductive system 
disorders, male and female reproductive diseases, hemangioma, deafiiess, glycoprotein la 
5 deficiency, desmoid disease, turcot syndrome, liver cirrhosis, hepatitis C, gastric disorders, 
pancreatic diseases like diabetes. Schistosoma mansoni infection. Spinocerebellar ataxia, 
Plasmodium falciparum parasitemia. Corneal dystrophy - Groenouw type I, Corneal dystrophy - 
lattice type I, and Reis-Bucklers comeal dystrophy by administering a test compound to a test 
animal at increased risk for the aforementioned disorders or syndromes. The test animal 

10 expresses a recombinant polypeptide encoded by a FCTRX nucleic acid. Expression or activity 
of FCTRX polypeptide is then measured in the test animal, as is expression or activity of the 
protein in a control animal which recombinantly-expresses FCTRX polypeptide and is not at 
increased risk for the disorder or syndrome. Next, the expression of FCTRX polypeptide in both 
the test animal and the control animal is compared. A change in the activity of FCTRX 

15 polypeptide in the test animal relative to the control animal indicates the test compound is a 
modulator of latency of the disorder or syndrome. 

In yet another aspect, the invention includes a method for determining the presence of or 
predisposition to a disease associated with altered levels of a FCTRX polypeptide, a FCTRX 
nucleic acid, or both, in a subject {e.g. , a human subject). The method includes measuring the 

20 amount of the FCTRX polypeptide in a test sample from the subject and comparing the amount 
of the polypeptide in the test sample to the amount of the FCTRX polypeptide present in a 
control sample. An alteration in the level of the FCTRX polypeptide in the test sample as 
compared to the control sample indicates the presence of or predisposition to a disease in the 
subject. Preferably, the predisposition includes, e.g.. Also within the scope of the invention is 

25 the use of a Therapeutic in the manufacture of a medicament for treating or preventing disorders 
or syndromes including, e.g.. Colorectal cancer, adenomatous polyposis coli, myelogenous 
leukemia, congenital ceonatal alloimmune thrombocytopenia, multiple human solid 
malignancies, malignant ovarian tumours particularly at the interface between epithelia and 
stroma, malignant brain tumors, manmiary tumors, human gliomas, astrocytomas, mixed 

30 glioma/astrocytomas, renal cells carcinoma, breast adenocarcinoma, ovarian cancer, melanomas, 
renal cell carcinoma , clear cell and granular cell carcinomas, autocrine/paracrine stimulation of 
tumor cell proliferation, autocrine/paracrine stimulation of tumor cell survival and tumor cell 
resistance to cytotoxic therapy, paranechmal and basement membrane invasion and motility of 
tumor cells thereby contributing to metastasis, tumor-mediated immunosuppression of T-cell 
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►e from immune 



surveilance, neurologicsiMR)rders, neurodegenerative disorders,^i^e trauma, familial 
myelodysplastic syndrome, Charcot-Marie-Tooth neuropathy, demyelinating Gardner syndrome, 
familial myelodysplastic syndrome; mental health conditions, immunological disorders, allergy 
and infection, asthma, bronchial asthma, Avellino type eosinophilia, lung diseases, reproductive 
disorders, male infertility, female reproductive system disorders, male and female reproductive 
diseases, hemangioma, deafriess, glycoprotein la deficiency, desmoid disease, turcot syndrome, 
liver cirrhosis, hepatitis C, gastric disorders, pancreatic diseases like diabetes. Schistosoma 
mansoni infection, Spinocerebellar ataxia, Plasmodium falciparum parasitemia, Comeal 
dystrophy - Groenouw type I, Comeal dystrophy - lattice type I, and Reis-Bucklers comeal 
dystrophy. Also, the expression levels of the new polypeptides of the invention can be used in a 
method to screen for various cancers as well as to determine the stage of cancers. 

In a further aspect, the invention includes a method of treating or preventing a 
pathological condition associated with a disorder in a mammal by administering to the subject a 
FCTRX polypeptide, a FCTRX nucleic acid, or a FCTRX-specific antibody to a subject (e.g., a 
human subject), in an amount sufficient to alleviate or prevent the pathological condition. In 
preferred embodiments, the disorder, includes, e.g.. Also within the scope of the invention is the 
use of a Therapeutic in the manufacture of a medicament for treating or preventing disorders or 
syndromes including, e.g.. Colorectal cancer, adenomatous polyposis coli, myelogenous 
leukemia, congenital ceonatal alloinmiune thrombocytopenia, multiple human solid 
malignancies, malignant ovarian tumours particularly at the interface between epithelia and 
stroma, malignant brain tumors, mammary tumors, human gliomas, astrocytomas, mixed 
glioma/astrocytomas, renal cells carcinoma, breast adenocarcinoma, ovarian cancer, melanomas, 
renal cell carcinoma , clear cell and granular cell carcinomas, autocrine/paracrine stimulation of 
tumor cell proliferation, autocrine/paracrine stimulation of tumor cell survival and tumor cell 
resistance to cytotoxic therapy, paranechmal and basement membrane invasion and motility of 
tumor cells thereby contributing to metastasis, tumor-mediated inrununosuppression of T-cell 
mediated immune effector cells and pathways resulting in tumor escape from immune 
surveilance, neurological disorders, neurodegenerative disorders, nerve trauma, familial 
myelodysplastic syndrome, Charcot-Marie-Tooth neuropathy, demyelinating Gardner syndrome, 
familial myelodysplastic syndrome; mental health conditions, immunological disorders, allergy 
and infection, asthma, bronchial asthma, Avellino type eosinophilia, Ivmg diseases, reproductive 
disorders, male infertility, female reproductive system disorders, male and female reproductive 
diseases, hemangioma, deafiiess, glycoprotein la deficiency, desmoid disease, turcot syndrome, 
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liver cirrhosis, hepatitis ^^astric disorders, pancreatic diseases li^^iabetes. Schistosoma 
mansoni infection, Spino^lbellar ataxia, Plasmodium falcipammlmrasitemia. Corneal 
dystrophy - Groenouw type I, Corneal dystrophy - lattice type I, and Reis-Bucklers corneal 
dystrophy. 

5 In yet another aspect, the invention can be used in a method to identity the cellular 

receptors and downstream effectors of the invention by any one of a number of techniques 
commonly employed in the art. These include but are not limited to the two-hybrid system, 
aflFinity purification, co-precipitation with antibodies or other specific-interacting molecules. 
Unless otherwise defined, all technical and scientific terms used herein have the same 
10 meaning as commonly imderstood by one of ordinary skill in the art to which this invention 

belongs. Although methods and materials similar or equivalent to those described herein can be 
used in the practice or testing of the present invention, suitable methods and materials are 
described below. All publications, patent applications, patents, and other references mentioned 
herein are incorporated by reference in their entirety. In the case of conflict, the present 
1 5 specification, including definitions, will control. In addition, the materials, methods, and 
examples are illustrative only and not intended to be limiting. 

Other features and advantages of the invention will be apparent fi'om the following 
detailed description and claims. 

DETAILED DESCRIPTION 

20 The invention is based, in part, upon the discovery of novel nucleic acid sequences that 

encode novel polypeptides. The novel nucleic acids and their encoded polypeptides are referred 
to individually as FCTRl, FCTR2, FCTR3, FCTR4, FCTR5, FCTR6, and FCTR7. The nucleic 
acids, and their encoded polypeptides, are collectively designated herein as "FCTRX". 
The novel FCTRX nucleic acids of the invention include the nucleic acids whose 

25 sequences are provided in Tables 1 A, 2 A, 3 A, 3C, 3E, 3F, 3G, 3H, 4 A, 5 A, 5C, 5E, 6A, 6C, and 
7A inclusive ("Tables 1 A - 7A"), or a fi-agment, derivative, analog or homolog thereof The 
novel FCTRX proteins of the invention include the protein fragments whose sequences are 
provided in Tables IB, 2B, 3B, 31, 4B, 5B, 5D, 6B, 6D, and 7B inclusive ("Tables IB - 7B"). 
The individual FCTRX nucleic acids and proteins are described below. Within the scope of this 

30 invention is a method of using these nucleic acids and peptides in the treatment or prevention of 
a disorder related to cell signaling or metabolic pathway modulation. 
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FCTRl 





Novel FCTRl is^owth factor ("FCTR") protein relatedfofollistatin-like gene, and 
mac25. FCTRl (also referred to by proprietary accession number 58092213.0.36) is a foU-length 
clone of 771 nucleotides, including the entire coding sequence of a 105 amino acid protein from 
5 nucleotides 438 to 753. The clone was originally obtained from thyroid gland, kidney, fetal 
kidney, and spleen tissues. 

The nucleotide sequence of FCTRl as presently determined is reported in Table 1 A. The 
start and stop codons are bolded and the 5' and 3' untranslated regions are underlined. 



10 GGTCCTCACCCCCTTCCTCTCTCCCAGCCTCGGTGTCTGGTTACGGCTCCTCTGCTCGCATTGTGACTTTGGGCCAGGCTGGGG^ 
AATGACCCGGGAGGGTCCCATGCGGCTACATAAAATTGGCAGCCTTAGAACTAGTGGGAAGGCGGGTGCGCGAAGTCGAGGGG 
AGAGAGGGGGCCGGAGGAGCTGCTTTCTGAATCCAAGTTCGTGGGCTCTCTCAGAAGTCCTCAGGACGGAGCAGAGGTGGCCGGCG 
GGCCCGGCTGACTGCGCCTCTGCTTTCTTTCCATAACCTTTTCTTTCGGACTCGAATCACGGCTGCTGCGAAGGGTCTAGTTCCGG 
ACACTAGGGCCCCAGATCGTGTCACATCCATATGACACTTGGAATGTGACAGGGCAGGATGTGATCTTTGGCTGTGAAGT^ 

15 CTACCCCA TGGCCTCCATCGAGTGGAGGAAGGATGGCTTGGACATCCAGCTGCCAGGGGATGACCCCCACATCTCTGTGCAGTTTA 
GGGGTGGACCCCAGAGGTTTGAGGTGACTGGCTGGCTGCAGATCCAGGCTGTGCGTCCCAGTGATGAGGGCACTTACCGCTGCCTT 
GCCCGCAATGCCCTGGGTCAAGTGGAGGCCCCTGCTAGCTTGACAGTGCTCACACCTGACCAGCTGAACTCTACAGGCATC 
GCTGCGATCACTAAACCTGGTTCCTGAGGAGGAGGCTGAGAGTGAAGAGAATGACGATTACTACTA GGTCCAGAGCTCTGGCC 



20 The predicted amino acid sequence of FCTRl protein corresponding to the foregoing 

nucleotide sequence is reported in Table IB. FCTRl was searched against other databases using 
SignalPep and PSort search protocols. The protein is most likely located in the cytoplasm 
(certainty=0.6500) and seems to have no N-terminal signal sequence. The predicted molecular 
weight of FCTRl protein is 1 171 1 .8 daltons. 



MASIEWRKDGLDIQLPGDDPHISVQFRGGPQRFEVTGWLQIQAVRPSDEGTYRCLARNALGQVEAPASLTVLTPDQLNSTGIPQLR 
SLNLVPEEEAESEENDDYY 

FCTRl was initially identified with a TblastN analysis of a proprietary sequence file for a 
30 foUistatin-like probe or homolog which was run against the Genomic Daily Files made available 
by GenBank. A proprietary software program (GenScan™) was used to fiirther predict the 
nucleic acid sequence and the selection of exons. The resulting sequences were further modified 
by means of similarities using BLAST searches. The sequences were then manually corrected 
for apparent inconsistencies, thereby obtaining the sequences encoding the fiiU-length protein. 
35 In an analysis of sequence databases, it was found, for example, that the FCTRl nucleic 

acid sequence has 31/71 bases (43%) identical and 46/71 bases positively alike to a Mus 
Musculus IGFBP-like protein (TREMBL Accession Number:BAA21725) shown in Table IC. 
In all BLAST alignments herein, the "E- value" or "Expect" value is a numeric indication of the 
probability that the aligned sequences could have achieved their similarity to the BLAST query 



Table lA. FCTRl nucleotide sequence (SEQ ID N0:1). 



25 



Table IB. Encoded FCTRl protein sequence (SEQ ID NO:2). 
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sequence by chance alon^^thin the database that was searched.^^ example, as shown in 
Table IC, the probabilitJBit the subject ("Sbjct") retrieved fronMrFCTRl BLAST analysis, 
in this case the Mus Musculus IGFBP-like protein, matched the Query FCTRl sequence purely 
by chance is L2xl0"'^ 

Table IC. BLASTP of FCTRl against Mus Musculus IGFBP-like protein (SEQ ID NO:38) 

PTNR:REMTREMBL-ACC:BAA21725 IGFBP-LIKE PROTEIN - MUS MUSCULUS (MOUSE), 270 AA. 

LENGTH = 270 

SCORE = 161 (56.7 BITS), EXPECT = 1.2E-11, P = 1.2E-11 
IDENTITIES = 31/71 (43%), POSITIVES = 46/71 (64%) 

QUERY: 9 DGLDIQLPGDDPHXSVQFRGGPQRFEVTGWLQIQAVRPSDEGTYRCLARNALGQVEAPAS 68 

^Ih HIM H + ll MM I I M I H Ml I I I Ikh ^ 

SBJCT: 191 EGLE-ELPGDHVNIAVQVRGGPSDHETTSWILINPLRKEDEGVYHCHAANAIGEAQSHGT 249 

QUERY: 6 9 LTVLTPDQLNS 79 

HI I I 
SBJCT: 250 VTVLDLNRYKS 260 

The amino acid sequence of FCTRl also had 26/58 bases (44%) identical, and 38/58 
bases (65%) positive for Mus Musculus Follistatin-like Protein shown in Table ID. 



Table ID. BLASTP of FCTRl against Mus Musculus FoUistatin-iike Protein (SEQ ID 

NO:39) 



PTNR:SPTREMBL-ACC:Q61581 FOLLISTATIN-LIKE 2 (FOLLISTATIN-LIKE PROTEIN) - MUS MCTSCULUS 
(MOUSE) , 238 AA. 

LENGTH = 238 

SCORE = 149 (52.5 BITS), EXPECT = 1.5E-10, P = 1.5E-10 
IDENTITIES = 26/58 (44%), POSITIVES = 38/58 (65%) 

QUERY: 15 LPGDDPHISVQFRGGPQRFEVTGWLQIQAVRPSDEGTYRCLARNALGQVEAPASLTVL 72 

till lllh^ llllh ^ ^ I I I I I h II I I ^Ih 

SBJCT: 165 LPGDRENLAIQTRGGPEKHEVTGWVLVSPLSKEDAGEYECHASNSQGQASAAAKITW 222 

The amino acid sequence of FCTRl also had 26/58 bases (44%) identical, and 38/58 
bases (65%) positive for Homo sapiens MAC25 protein shown in Table IE. 



Table IE. BLAST? of FCTRl against Homo sapiens MAC25 protein (SEQ ID NO:40) 



PTNR:SPTREMBL-ACC:Q07822 MAC25 PROTEIN - HOMO SAPIENS (HUMAN), 277 AA. 
LENGTH =277 

SCORE = 149 (52.5 BITS), EXPECT = 3.2E-10, P = 3.2E-10 
IDENTITIES = 26/58 (44%), POSITIVES - 38/58 (65%) 

QUERY: 15 LPGDDPHISVQFRGGPQRFEVTGWLQIQAVRPSDEGTYRCLARNALGQVEAPASLTVL 72 

MM ++-^+1 MM^-^ MMk ^ I M I I 1+ M I I 

SBJCT: 209 LPGDRDNLAIQTRGGPEKHEVTGWVLVSPLSKEDAGEYECHASNSQGQASASAKITW 266 
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^jl^nce of FCTRl also had 26/58 bases^^ 
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The amino acid HPfence of FCTRl also had 26/58 bases^F/o) identical, and 38/58 
bases (65%) positive for Mus musculus MAC25 protein shovm in Table IF. 

Table IF. BLASTP of FCTRl against Mus musculus MAC25 protein (SEQ ID NO:41) 

PTNR:SPTREMBL-ACC:088812 MAC25 - MUS MDSCULUS (MOUSE), 281 AA 

LENGTH = 2 81 

SCORE = 149 (52.5 BITS), EXPECT = 3.4E-10, P = 3.4E-10 
IDENTITIES = 26/58 (44%), POSITIVES = 38/58 (65%) 



QUERY: 15 LPGDDPHISVQFRGGPQRFEVTGWLQIQAVRPSDEGTYRCLARNALGQVEAPASLTVL 72 

15 MM ^-^^H MM^+ MMk + + I I M I k M I I +M^ 

SBJCT: 208 LPGDRENLAIQTRGGPEKHEVTGWVLVSPLSKEDAGEYECHASNSQGQASAAAKITW 265 

The amino acid sequence of FCTRl also had 26/58 bases (44%) identical, and 38/58 
bases (65%) positive for Homo sapiens Prostacyclin-stimulating factor shown in Table IG. 

20 Table IG. BLAST? of FCTRl against Homo sapiens Prostacyclin-stimulating factor (SEQ 
^ ID NO:42) 

!=f PTNR:SPTREMBL-ACC:Q16270 PROSTACYCLIN-STIMULATING FACTOR - HOMO SAPIENS (HUMAN), 282 

Vl AA 

D 25 LENGTH = 282 

\f% SCORE = 149 (52.5 BITS), EXPECT = 3.4E-10, P = 3.4E-10 

IDENTITIES = 26/58 (44%), POSITIVES = 38/58 (65%) 

s 30 QUERY: 15 LPGDDPHISVQFRGGPQRFEVTGWLQIQAVRPSDEGTYRCLARNALGQVEAPASLTVL 72 

P MM MM+^ MMk + + I I I I I k M I I 

hi SBJCT: 209 LPGDRDNLAIQTRGGPEKHEVTGWVLVSPLSKEDAGEYECHASNSQGQASASAKITW 266 

n 

5 The amino acid sequence of FCTRl also had 1 8/44 bases (40%) identical, and 25/44 

35 bases (56%) positive for rat Colorectal cancer suppressor shown in Table IH. 

Table IH. BLASTP of FCTRl against rat Colorectal cancer suppressor (SEQ ID NO:43) 

PTNR:PIR-rD:B40098 COLORECTAL CANCER SUPPRESSOR DCC - RAT (FRAGMENTS) 
40 LENGTH = 144 

SCORE = 78 (27.5 BITS), EXPECT = l.lE-05, SUM P(2) = l.lE-05 
IDENTITIES = 18/44 (40%) , POSITIVES = 25/44 (56%) 

45 QUERY: 3 3 FEVTGW- -LQIQAVRPSDEGTYRCLARNALGQVEAPASLTVLTP 74 

t M I MM khi I I I I I I 

SBJCT: 101 FQIVGGSNLRILGWKSDEGFYQCVAENEAGNAQSSAQLIVPKP 144 

SCORE = 37 (13.0 BITS), EXPECT = l.lE-05, SUM P(2) = l.lE-05 
50 IDENTITIES = 8/19 (42%), POSITIVES = 12/19 (63%) 

QUERY: 1 MASIEWRKDGLDIQL-PGD 18 

I H h Ml 

SBJCT: 30 MPTIHWQKNQQDLTPNPGD 4 8 

55 
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The amino acid s^ence of FCTRl also had 32/83 bases^^/o) identical, and 45/83 
bases (54%) positive to HPs 55-137, and 24/68 bases (35%) idelHl, and 37/68 bases (54%) 
positive to bases 166-225 of Homo sapiens PTPsigma-(Brain) Precursor shown in Table II. 

Table 11. BLASTP of FCTRl ikgainst Homo sapiens PTPsigma-(Brain) Precursor (SEQ ID 

NO:44) 

PTNR:TREMBLNEW-ACC:AAD09360 FTPS IGMA- (BRAIN) PRECURSOR - HOMO SAPIENS (HUMAN), 1502 
AA. 

LENGTH = 1502 

SCORE ^ 109 (38.4 BITS), EXPECT = 0.00010, P = 0.00010 
IDENTITIES = 32/83 (38%), POSITIVES = 45/83 (54%) 

QUERY: 14 QLPGDD- PHISVQFRG GPQRFEVTGW LQIQAVR- PSDEGTYRCLARNALG 61 

1111+-^ ^1 MM + hM +1 I M I khh^l 

SBJCT: 55 QATGDPKPRVTWNKKGKKVNSQRFETIEFDESAGAVLRIQPLRTPRDENVYECVAQNSVG 114 

QUERY: 62 QVEAPASLTVLTPDQLNSTGIPQL 85 

I MM Ml I I I + 
SBJCT: 115 EITVHAKLTVLREDQLPS-GFPNI 13 7 

SCORE = 77 (27.1 BITS), EXPECT =0.25, P = 0.22 
IDENTITIES = 24/68 (35%), POSITIVES = 37/68 (54%) 

QUERY: 4 lEWRKDGLDIQLPGDDPHISVQFRGGPQRFEVTGWLQIQAVRPSDEGTYRCLARNALG-Q 62 

I I M I ^ Ml I +1 llh+ -hi I hi h I - 

SBJCT: 166 ITWFKDFLPV DPSAS NGRIKQLR-SGALQIESSEETDQGKYECVATNSAGVR 216 

QUERY: 63 VEAPASLTV 71 

-Ihl I 
SBJCT: 217 YSSPANLYV 225 

The amino acid sequence of FCTRl also had 32/83 bases (38%) identical, and 45/83 
bases (54%) positive for amino acids 55-137 and 26/69 bases (37%) identical, and 38/69 (54%) 
positive for amino acids 166-234 of Homo sapiens Protein-Tyrosine Phosphatase Sigma shown 
in Table IJ. 

Table IJ. BLASTP of FCTRl against Homo sapiens PTPsigma-(Brain) Precursor (SEQ ID 

NO:45) 

PTNR:SPTREMBL-ACC:Q13332 PROTEIN-TYROSINE PHOSPHATASE, RECEPTOR -TYPE, S PRECURSOR (EC 
3.1.3.48) (PROTEIN-TYROSINE PHOSPHATASE SIGMA) (R-PTP-SIGMA) (PTPRS) - HOMO SAPIENS 
(HUMAN) , 1948 AA. 

LENGTH = 1948 

SCORE = 109 (38.4 BITS), EXPECT = 0.00013, P = 0.00013 
IDENTITIES = 32/83 (38%), POSITIVES = 45/83 (54%) 

QUERY: 14 QLPGDD- PHISVQFRG- - -GPQRFEVTGW LQIQAVR -PSDEGTYRCLARNALG 61 

I II I H MM - hll -II II I hl-h-l 

SBJCT: 55 QATGDPKPRVTWNKKGKKVNSQRFETIEFDESAGAVLRIQPLRTPRDENVYECVAQNSVG 114 

QUERY: 62 QVEAPASLTVLTPDQLNSTGIPQL 85 

I MM III I II - 
SBJCT: 115 EITVHAKLTVLREDQLPS-GFPNI 137 

SCORE = 88 (31.0 BITS), EXPECT = 0.023, P = 0.022 
IDENTITIES = 26/69 (37%), POSITIVES = 38/69 (55%) 
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QUERY: 
SBJCT: 
QUERY: 
SBJCT : 



4 IEWRKDGLDIQLPGDDPHISVQFRGGPQRFEVT---GWLQIQAVRPSDEGTYRCLARNAL 60 

I I 11 IJ^ ^111 ^111 I 111^^ I 1^1 1^ 

166 ITWFKDFLi^^ASNGRIK-QLRS--ETFESTPIRGALQIESS^MR3KYECVATNSA 222 



61 G-QVEAPASLTV 71 

I ^ ^\\-\ I 
223 GVRYSSPANLYV 234 



A ClustalW analysis comparing the protein of the invention with related protein 
sequences is given in Table IK, v^th FCTRl shown on line 2. In the ClustalW alignment of the 
FCTRl protein, as well as all other ClustalW analyses herein, the black outlined amino acid 
residues indicate regions of conserved sequence (i.e., regions that may be required to preserve 
structural or functional properties), whereas non-highlighted amino acid residues are less 
conserved and can potentially be mutated to a much broader extent without altering protein 
structure or function. 



Table IK, ClustalW Analysis of FCTRl 

1) Q07822 MAC25 PROTEIN. (SEQ ID NO:40) 

2) Q16270 PROSTACYCLIN- STIMULATING FACTOR. (SEQ ID NO: 42) 
3)Q61581_F0LLISTATIN-LIKE 2: FOLLISTATIN-LIKE 2 (FOLLISTATIN-LIKE PROTEIN) 

ID N0:39) 

4) BAA21725 IGFBP-LIKE PROTEIN (SEQ ID NO: 38) 

5) FCTRl (SEQ ID NO: 2) 

6) B40098 COLORECTAL CANCER SUPPRESSOR DCC - RAT (FRAGMENTS) (SEQ ID NO: 43) 



(SEQ 



Q07822 

Q16270 

Q61581_ 

BAA21725 

FCTRl 

B40098 

Q07822 

Q16270 

Q61581_ 

BAA21725 

FCTRl 

B40098 

Q07822 

Q16270 

Q61581_ 

BAA21725 

FCTRl 

B40098 

Q07822 

Q16270 

Q61581_ 

BAA21725 

FCTRl 

B40098 

Q07822 

Q16270 

Q61581_ 

BAA21725 

FCTRl 

B40098 




PASraPLPHLGCLLGETgl 
PAsBaPLPgLGCLLGETlSi 

QDRggAPSgCPAPW I SAjgg 





p1rflsqtes|t 




gHKARDg 

|FMgDgVLLKgEgl|DPMPTIHWQKNQQDLTPNPGDSRVWPj|WFlNHgSN^AYESMD| 




KgGgG^ 

^ .YSSVPgg 
JItPDQLNSTGI PQLRSLNLVPEEEAESEgND© 

Bpkp * 
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Q07822 

Q16270 

Q61581_ 

BAA21725 

FCTRl 



IGFBP is expressed in nenrostem cell and developing central nervous system. MAC-25, a 
follistatin like protein is a growth suppressor of osteosarcoma cells, and meningiomas. DCC is 
expressed in most normal tissues especially in colonic mucosa, but is deleted in colorectal 
10 cancers. 

Since FCTRl has similarity to these proteins (shown in BlastP, Tables IC-IJ, and in 
clustalW, Table IK) it is likely that it has similar function. Therefore FCTRl could function as 
on or more of the follovmig: a tumor suppressor geneor regulator of neurological system 
development. 

1 5 Based on the protein similarity and tissue expression, FCTRl may be useful in the 

following diseases and uses: 

(i) Tissue regeneration in vitro and in vivo 

(ii) Neurological disorders, neurodegenerative disorders, nerve trauma 

(iii) Reproductive health 

20 (iv) Immunological disorders, allergy and infection 

(v) In cancer as a diagnostic and prognostic marker, as well as a protein therapeutic 

FCTR2 

FCTR2 (alternatively referred to herein as AC012614_1.0.123), is a growth factor 
bearing sequence similarity to human KIAA1061 protein and to genes involved in neuronal 

25 development and reproductive physiology (e.g., cell adhesion molecules, follistatin, roundabout 
and fi-azzled). FCTR2 is a full-length clone of 5502 nucleotides, including the entire coding 
sequence of a 815 amino acid protein. This sequence is expressed in glioma, osteoblast, other 
cancer cells, lung carcinoma, small intestine (This sequence maps to Unigene Hs. 123420 which 
is expressed in brain, breast, kidney, pancreas, pooled tissue). 

30 A FCTR2 ORF begins with an ATG initiation codon at nucleotides 420-422 and ends 

with a TGA codon at nucleotides 2865-2867. Putative untranslated regions upstream from the 
initiation codon and dovmstream from the termination codon are underlined in Table 2A, and the 
start and stop codons are in bold letters. 

Table 2A. FCTR2 Nucleotide Sequence (SEQ ID NO:3). 

35 CAATTTCACACAGGAAACAGCTATGCCATGATTACGCAAGTTGGTACCGAGCTCGGATCCACTAGTAACGGCCGCCA 

TGCTGGAATTCGGCTTACTCACTATAGGGCTCGAGCGGCTGCCCGGGCAGGTCATTAATTCCATTTCTTTTTAGAGTATC 
ACAGCTTTCTCCTTCACTGACCACCCTTTGCTTCCTGTCAGAAAGCCCTGGACAGAACTCTCTGTGGGATTCTGCCCATG 
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TTTCTGAGATATCGCCTCAATTqTCCTGGCTGGGCTGTCGGGTCTGCCCGTTTTACAGATCGGCAAACTGGAGTGGGAAG 
TATCCGGGTGGCTTCC Trr PPiJ^ Mi III TGGAGCAGCTACTGAAACAATCAGg 3^KaG AAGCTTTGAAGTCAC^ 

AGAAGAGAAGACTCca= 7vriTl^y'rTrriiTrTTrrTrriTrrrirrr n i i i i i i i i n i i i i i n i i 

gctgccctgacctctttgggctttccagccgcaacgagctgctggcctcctgcgggaagaagttctgcagccgagggagc 

cggtgcgtgctcagcaggaagacaggggagcccgaatgccagtgcctggaggcatga^ggcccagctacgt^ 

CGGCTCTGATGGGAGGTTTTATGAAAACCACTGTAAGCTCCACCGTGCTGCTTGCCTCCTGGGAAAGAGGATC^ 

tccacagcaaggactgtttcctcaaaggtgacacgtgcaccatggccggctacgcccgcttgaagaatgtccttct^ 

ctccagacccgtctgcagccactccaagaaggagacagcagacaagaccctgcctcccagaagcgcctcctggtggaatc 

tctgttcagggacttagatgcagatggcaatggccacctcagcagctccgaactggctcagcatgtgctgaagaa 

acctggatgaagacttacttggttgctcaccaggtgacctcctccgatttgacgattacaacagtgacagctccctgacc 

ctccgcgagttctacatggccttccaagtggttcagctcagcctcgcccccgaggacagggtcagtgtgaccacagtgac 

CGTGGGGCTGAGCACAGTGCTGACCTGCGCCGTCCATGGAGACCTGAGGCCACCAATCATCTGGAAGCGCAACGGGCT^ 

CCCTGAACTTCCTGGACTTGGAAGACATCAATGACTTTGGAGAGGATGATTCCCTGTACATCACCAAGGTGACC^ 

CACATGGGCTU^TTACACCTGCCATGCTTCCGGCCACGAGCAGCTGTTCCAGACCCACGTCCTGCAGG^^ 

agtcatccgtgtctatccagagagccaggcacaggagcctggagtggcagccagcctaagatgccatgctgagggcattc 

ccatgcccagaatcacttggctgaaaaacggcgtggatgtctcaactcagatgtccaaacagctctccct 

gggagcgaactccacatcagcagtgttcggtatgaagacacaggggcatacacctgcattgccaaaaatg^ 

ggatgaagatatctcctcgctcttcattgaagactcagctagaaagacccttgcaaacatcctgtggcgagaggaaggcc 

tcagcgtgggaaactvtgttctatgtcttctccgacgacggtatctvtcgtcatccatcctgtggactgtgagatccaga^ 

CACCTCAAACCCACGGAAAAGATTTTCATGAGCTATGAAGAAATCTGTCCTCAAAGAGAAAAAAATGCAACCC^^ 

ccagtgggtatctgcagtcaatgtccggaaccggtacatctatgtggcccagccagcactgagcagagtccttgtggtcg 
acatccaagcccagaaagtcctacagtccataggtgtggaccctctgccggctaagctgtcctatgacaagtcacatgac 
caagtgtgggtcctgagctggggggacgtgcacaagtcccgaccaagtctccaggtgatcacagaagccag 

GAGCCAGCACCTCT^TCCGCACACCCTTTGCAGGAGTGGATGATTTCTTCATTCCCCCAACAAACCTCATC^ 

TCAGGTTTGGCTTCATCTTCT^CAAGTCTGATCCTGCAGTCCACAAGGTGGACCTGGAAACT^TGA^^ 

ATCGGCCTGCT^CCT^CCATGGCTGCGTGCCCCAGGCCATGGCACACACCCT^CCTGGGCGGCTACTTCTTCA 

ACAGGACAGCCCCGCCTCTGCTGCCCGACAGCTGCTCGTTGACAGTGTCACAGACTCTGTGCTTGGCCCCAATGGTGATG 

TAACAGGCACCCCACACACATCCCCCGACGGGCGCTTCATAGTCT^GTGCTGCAGCTGACAGCCCCTGGCTG 

GAGATCACAGTGCGGGGCGAGATCCAGACCCTGTATGACCTGCAAATAAACTCGGGCATCTCT^GACTTGGCCTTCCAGCG 

CTCCTTCACTGAAAGCAATCAATACAACATCTACGCGGCTCTGCTVCACGGAGCCGGACCTGCTGTTCCTGGAGCTGTCC^ 

cggggaaggtgggct^tgctgaagaacttaaaggagccacccgcagggcct^gctcagccctgggggggtaccctvcagaat 

atgagggacagtgggctgtttggacagtacctcctcacaccagcccgagagtcactgttcctcatcaatgggagacaa 

cacgctgcggtgtgaggtgtcaggtataaagggggggaccacagtggtgtgggtgggtgaggtatg aagggccc^ 

gagccctgggcct^ggaacaccccctagtcctgacactgcagcctcaagmggtacgctgtacattttta^ 

caaaaacctgtactcgctttgtggttcaacactggtctccttgcaagtttcctagtataaggtatgcgctgctaccaaga 

TTGGGGTTTTTTCGTTAGGAAGTATGATTTATGCCTTGAGCTACGATGAGAACATATGCTGCTGTGTAAAGGGATCATTT 

CTGTGCCAAGCTGCACACCGAGTGACCTGGGGACATCATGGAACCAAGGGATCCTGCTCTCCAAGC^ 

GTTGCCTTCACATAGTCATTGTCCCTTACTGCCAGACCCAGCCAGACTTTGCCCTGACGGAGTGGCCCGGAAGCAGAGGC 

CGACCAGGAGCAGGGGCCTCCCTCCCGAACTGAAAGCCCATCCGTCCTCGCGTGGGACCGCATCTTCTCCCTCGCAGCTG 

CTTCTTGCTTTTCTTTCCATTTGACTTGCTGTAAGCCTGAGGGAGAGCCAACAAGACTTACTGCATCTTGGGGGATGGGG 

AAATCACTCACTTTATTTTGGAAATTTTTGATTAAAAAAAAATTTTATAATCTCAAATGCTAGTAAGCAGAAAGATGCTC 

TCCGAGGTCCAACTATATCCTTCCCTGCCTTAGGCCGAGTCTCGGGGGTGGTCACAACCCCACATCCCACAGCCAGA^ 

AACAATGGTCATCTGAGAATACTGGCCCTGTCGACTATTGCCT^CCCTGCTTCTCCAAGAGCAGACCTVGGCCACCTCA 

GTAAGGACTCGGTTCTGTGTTGGGACCCCAAAAAACCAGAACAAGTTCTGTGTGCCTCCTTTCAGCACAGAAGGGAGACA 

TCTCATTAGTCAGGTCTGGTACCCCTVGATTCAGGGCAGACTGGGCTTGCCTGGCAAGGTATGGGTGGCCTCCAGGCTCT^ 

TGCAGAAACCCCAAGGACACGAGTGGGGCCAGGTGAGTTCCTGAAGCTATACCTTTTCAAAACAGATTTTGTTTTCCTAC 

CTGTGGCCCATCCACTCCTCTCTGGTACCCCATCCCCG(:a.TCAGCACTGCAGAGAGAACACATTTCGGCGAGGGTTTTC 

TACCCACATTCCCCAATCAATACACACACACTGCAGAACCCAGAACAGAAGGCCAC^ 

TATGTGTCTCAGGCTGTGGTGACTCTCACATGGGCATCGAAGAAGTACAACCCACATAGCCCTCTGGAGACCGCCTAGAT 

CAGAGACTCAGCAAAAACAGGCTCGCCTTCCCTCTCCCACATATGAGTGGAACTTACATGTGTCCTGGTTTGAATC 

TTTTGCTVAGCCT^CACGGGTTGGGAGAGGTGGTCTCACCACAGACGTCTTTGCTAATTTGGCCACCTTCACCTA 

GACCAGGATTTTCCTTTGCCATTAAGGAATGAACTCTTTCAAGGAGAGGAAACCCTAGACTCTGTGTCACTCTCAACAC^ 

CACAGCTCCTTTCACTCCTGCCTGACTGCCAAGCCACCTGCATCCCCCGCCCCAGATCTCATGAGATCAATCACTTC 

GTCTCACGCAACTTGGTCCACCAAACGCCTGTCCCCTGTAACTCCTAGGGGTGCGCCTAGACAGGTACGTCTGTTTTTTA 

TTTTAAAAGATATGCTATGTAGATATAAGTTGAGGAAGCTCACCTCAAAAGCCTAGAATGCAGTTTCACAGTAGCTGGG^ 

TGCATGGATGACCCATCTCACCCCTTTTTTTTTCCTGCCTCAATATCTTGATATGTTATGTTTACfCCCAATCTCCCATT 

TTTACa^CTAAAATTCTCCAACTTTCATAAACTTTTTTTTGGAAAAATTTCCATTGTATCAGCCCCTGACAGAAAAAGG 

TCTCTGAGCCTAAAGGAGGAAAAGTCCCACCAACTACCAGACCAGAACACGAGCCCCTCTGGGCAGCAGGATTCCTAAGT 

CAAAGACCAGTTTGACCCAAACTGGCCTTTTAAAATAATCAGGAGTGACAGAGTCAACTTCTGCAGCACCTGCT^ 

CCACTGTCCCTTCCATCTTGGAATGTGTCTAAAAAAGCATAGCTGCCCTTTGCTGTCCTCAGAGTGCATTTCCTGGAGAC 

GGCAGGCTTAGGTCTCACTGACAGCATGCCAGACACAACTGj^TCGAAGCAGGCCTGAAGCCTAGGTCAGGGTTTCAC^ 

GTCCAGCCCCAGGAGGCAAAGTCACCAATGCAGGGAGGTAAATGCCTTTTGGCAGGAAAACCAATAGAGTTGGTTGGGTG 

GGGAGTCAGGGGTGGGAGGAGAAGGAGGAAGAGGAGGAAGGCCAGACTGGCCTGCCCTTTCTCCCATACTTCACCCCAGC 

AGAGGTTCATGGGACACAGTTGGAAAGCCACTGGGAGGAAATGCCTCACTACAGGGGGGCCTC^ 

GGTAATCCTCCTAATGAACCCACAAGGTCAATTCACAACTGATATCTTAGCTATTAAAGAAGTACTGACTTTACCAAAAG 
AATCATCAAGAAAGCTATTTATATAAACCCCCTCAGTCATTTTGAAATAAAATTAATTTTAC 
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The predicted amu^acid sequence of FCTR2 protein conaaDonding to the foregoing 
nucleotide sequence is rqBRed in Table 2B. FCTR2 was searcheHfainst other databases using 
SignalPep and PSort search protocols. The protein is most likely located in the mitochondrial 
matrix space (certainty=0.4718) and seems to have no N-terminal signal sequence. The predicted 
5 molecular weight is 90346.9 Daltons. 



Table 2B. FCTR2 Protein Sequence (SEQ ID NO:4). 

MQCDVGDGRLFRLSLKRALSSCPDLFGLSSRNELLASCGKKFCSRGSRCVLSRKTGEPECQCLEACRPSYVPVCGSDGRFYE^^ 
LHRAACLLGKRITVIHSKDCFLKGDTCTMAGYARLKNVLIJU^QTRLQPLQEGDSRQDPASQKRLLVESLFR^ 

1 0 AQHVLKKQDLDEDLI^CSPGDLLRFDDYNSDSSLTLREFYMAFQWQLSLAPEDRVSVTTVTVGLSTVLTCAW 

GLTmFLDLEDINDFGEDDSLYITKVTTIHMGNYTCHASGHEQLFQTHVLQVNVPPVIRVYPESQAQEPGVAASL^ 
ITWLKNGVDVSTQMSKQLSLLANGSELHISSVRYEDTGAYTCIAKNEVGVDEDISSLFIEDSARKTLANILWREEGLSVGN^ 
SDIXJIIVIHPVDCEIQRHLKPTEKIFMSYEEICPQREKNATQPCQWVSAVNVRNRYIYVAQPALSRVLVVDIQAQ^ 
PAKLSYDKSHDQVWVLSWGDVHKSRPSLQVITEASTGQSQHLIRTPFAGVDDFFIPPTNLIINHIRFGFIFNKSDPAVHKV^ 

15 MPLKTIGLHHHGOTPQAMAHTHI^YFFIQCRQDSPASAARQLLVDSVTDSVLGPNGDVTGTPHTSPDGRFIVSAAADSPW 

ITVRGEIQTLYDLQINSGISDIAFQRSFTESNQYNIYAALHTEPDLLFLELSTGKVGMLKNLKEPPAGPAQPWGGTHRIMRDSGLF 
GQYLLTPARESLFLINGRQNTLRCEVSGIKGGTTWWVGEV 



In a BLASTN search it was also found that nucleotides 784-5502 of FCTR2 nucleic acid 
20 had 4672 of 4719 bases (99%) identical to Homo sapiens mRNA for KIAA1061 protein, partial 
cds (GenBank Acc:AB028984) (Table 2C). 



25 



Table 2C. BLASTN of FCTR2 2ig2i\n%t Homo sapiens mRNA for KIAA1061 protein (SEQ 

ID NO:46) 

> GI I 5689458 I DBJI AB028984 . 1 ( AB028984 HOMO SAPIENS MRNA FOR KIAA1061 PROTEIN, PARTIAL 
CDS 

LENGTH = 4719 



30 



SCORE = 9075 BITS (4578), EXPECT = 0.0 
IDENTITIES = 4672/4719 (99%) 
STRAND = PLUS / PLUS 



35 



40 



45 



50 



55 



QUERY: 


784 


SB JCT : 


1 


QUERY: 


844 


SBJCT : 


61 


QUERY: 


904 


SBJCT : 


121 


QUERY: 


964 


SBJCT: 


181 


QUERY: 


1024 


SBJCT : 


241 


QUERY: 


1084 


SBJCT : 


301 



IIIIIIIIIIIIIIIMIIIIIIIIIIIIIillllllllllllllllllllllllllMI 

AGAATGTCCTTCTGGCACTCCAGACCCGTCTGCAGCCACTCCAAGAAGGAGACAGCAGAC 6 0 
AAGACCCTGCCTCCCAGAAGCGCCTCCTGGTGGAATCTCTGTTCAGGGACTTAGATGCAG 903 

IIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIillMIIIIIIIIIIIIIIIIII 

AAGACCCTGCCTCCCAGAAGCGCCTCCTGGTGGAATCTCTGTTCAGGGACTTAGATGCAG 120 



ATGGCAATGGCCACCTCAGCAGCTCCGAACTGGCTCAGCATGTGCTGAAGAAGCAGGACC 

lllllllllllllllllllllilllllilllllllllMIIIIIIIIIIIIIIIIIIIII 
ATGGCAATGGCCACCTCAGCAGCTCCGAACTGGCTCAGCATGTGCTGAAGAAGCAGGACC 



963 



180 



TGGATGAAGACTTACTTGGTTGCTCACCAGGTGACCTCCTCCGATTTGACGATTACAACA 1023 

llllllllillMIIIIMMIIIIIIMIIIilMIIIIIIIIIIIMIIIIIIIIIII 

TGGATGAAGACTTACTTGGTTGCTCACCAGGTGACCTCCTCCGATTTGACGATTACAACA 240 
GTGACAGCTCCCTGACCCTCCGCGAGTTCTACATGGCCTTCCAAGTGGTTCAGCTCAGCC 1083 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIMI 

GTGACAGCTCCCTGACCCTCCGCGAGTTCTACATGGCCTTCCAAGTGGTTCAGCTCAGCC 300 
TCGCCCCCGAGGACAGGGTCAGTGTGACCACAGTGACCGTGGGGCTGAGCACAGTGCTGA 1143 

lilllllllllMMIIIIMIIMIIIIIIIIMMIIIIIIIIIIIIIIIIIIillli 

TCGCCCCCGAGGACAGGGTCAGTGTGACCACAGTGACCGTGGGGCTGAGCACAGTGCTGA 360 
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QUERY: 


1144 


SB JCT : 


361 


QUERY: 


1204 


SB JCT : 


421 


QUERY: 


1264 


SB JCT : 


481 


QUERY: 


1324 


SB JCT : 


541 


QUERY: 


1384 


SB JCT : 


601 


QUERY: 


1444 


SB JCT : 


661 


QUERY: 


1504 


SBJCT : 


721 


QUERY: 


1564 


SBJCT : 


781 


QUERY: 


1624 


SBJCT : 


841 


QUERY: 


1684 


SBJCT : 


901 


QUERY: 


1744 


SBJCT : 


961 


QUERY: 


1804 


SBJCT : 


1021 


QUERY: 


1864 


SBJCT : 


1081 


QUERY: 


1924 


SBJCT : 


1141 


QUERY: 


1984 


SBJCT: 


1201 


QUERY: 


2044 


SBJCT: 


1261 


QUERY: 


2104 


SBJCT : 


1321 


QUERY: 


2164 



CCTGCGCCGTCCATGGAGACCTGAGGCCACCAATCATCTGGAAGCGf 

llllllllli^llMIMIIIIIIMIMIIIIilllll 

CCTGCGCCG^^fcCGAGACCTGAGGCCACCAATCATCTGGAAGi 



CGCAAO 



CGGGCTCACCC 1203 

Illlllilll 
IGGGCTCACCC 420 



TGAACTTCCTGGACTTGGAAGACATCAATGACTTTGGAGAGGATGATTCCCTGTACATCA 1263 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIII 

TGAACTTCCTGGACTTGGAAGACATCAATGACTTTGGAGAGGATGATTCCCTGTACATCA 4 80 

CCAAGGTGACCACCATCCACATGGGCAATTACACCTGCCATGCTTCCGGCCACGAGCAGC 1323 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIII 
CCAAGGTGACCACCATCCACATGGGCAATTACACCTGCCATGCTTCCGGCCACGAGCAGC 54 0 

TGTTCCAGACCCACGTCCTGCAGGTGAATGTGCCGCCAGTCATCCGTGTCTATCCAGAGA 13 83 

Mill llllll II II 1 1 III Mill MM III III llllll Mill II II III II Mill 

TGTTCCAGACCCACGTCCTGCAGGTGAATGTGCCGCCAGTCATCCGTGTCTATCCAGAGA 600 
GCCAGGCACAGGAGCCTGGAGTGGCAGCCAGCCTAAGATGCCATGCTGAGGGCATTCCCA 1443 

IIIMIIIIIIIIIIIIIIIIMIIIIMIIIMIIIIIIIIIIIIIIIIIIIMIIIM 

GCCAGGCACAGGAGCCTGGAGTGGCAGCCAGCCTAAGATGCCATGCTGAGGGCATTCCCA 660 

TGCCCAGAATCACTTGGCTGAAAAACGGCGTGGATGTCTCAACTCAGATGTCCAAACAGC 1503 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII IIIIIIIIMII 
TGCCCAGAATCACTTGGCTGAAAAACGGCGTGGATGTCTCAACTCAGATGTCCAAACAGC 720 

TCTCCCTTTTAGCCAATGGGAGCGAACTCCACATCAGCAGTGTTCGGTATGAAGACACAG 1563 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIII II llllllllllll 
TCTCCCTTTTAGCCAATGGGAGCGAACTCCACATCAGCAGTGTTCGGTATGAAGACACAG 780 

GGGCATACACCTGCATTGCCAAAAATGAAGTGGGTGTGGATGAAGATATCTCCTCGCTCT 1623 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GGGCATACACCTGCATTGCCAAAAATGAAGTGGGTGTGGATGAAGATATCTCCTCGCTCT 840 

TCATTGAAGACTCAGCTAGAAAGACCCTTGCAAACATCCTGTGGCGAGAGGAAGGCCTCA 1683 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIII 

TCATTGAAGACTCAGCTAGAAAGACCCTTGCAAACATCCTGTGGCGAGAGGAAGGCCTCA 900 
GCGTGGGAAACATGTTCTATGTCTTCTCCGACGACGGTATCATCGTCATCCATCCTGTGG 1743 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMII llllllllllll 

GCGTGGGAAACATGTTCTATGTCTTCTCCGACGACGGTATCATCGTCATCCATCCTGTGG 960 
ACTGTGAGATCCAGAGGCACCTCAAACCCACGGAAAAGATTTTCATGAGCTATGAAGAAA 1803 

IIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIII 

ACTGTGAGATCCAGAGGCACCTCAAACCCACGGAAAAGATTTTCATGAGCTATGAAGAAA 1020 

TCTGTCCTCAAAGAGNNNNNNNTGCAACCCAGCCCTGCCAGTGGGTATCTGCAGTCAATG 1863 

IIIIIIMIIIIIII IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 
TCTGTCCTCAAAGAGAAAAAAATGCAACCCAGCCCTGCCAGTGGGTATCTGCAGTCAATG 1080 

TCCGGAACCGGTACATCTATGTGGCCCAGCCAGCACTGAGCAGAGTCCTTGTGGTCGACA 1923 

Mill IIMIIII llllllllllll llllll Mill llllllllll II Mllllllllll 

TCCGGAACCGGTACATCTATGTGGCCCAGCCAGCACTGAGCAGAGTCCTTGTGGTCGACA 1140 
TCCAAGCCCAGAAAGTCCTACAGTCCATAGGTGTGGACCCTCTGCCGGCTAAGCTGTCCT 1983 

IIIIMIIilllMIIIIIIIIIM lllllllilMIIIIIIIIIIII llllllllllll 

TCCAAGCCCAGAAAGTCCTACAGTCCATAGGTGTGGACCCTCTGCCGGCTAAGCTGTCCT 1200 
ATGACAAGTCACATGACCAAGTGTGGGTCCTGAGCTGGGGGGACGTGCACAAGTCCCGAC 2043 

Mill Mill III Mill IIMIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

ATGACAAGTCACATGACCAAGTGTGGGTCCTGAGCTGGGGGGACGTGCACAAGTCCCGAC 1260 
CAAGTCTCCAGGTGATCACAGAAGCCAGCACCGGCCAGAGCCAGCACCTCATCCGCACAC 2103 

Mill Mill III Mill IIIIMIIIIIilllllllMIIIIIIIIIIIIIIIMI III 

CAAGTCTCCAGGTGATCACAGAAGCCAGCACCGGCCAGAGCCAGCACCTCATCCGCACAC 1320 
CCTTTGCAGGAGTGGATGATTTCTTCATTCCCCCAACAAACCTCATCATCAACCACATCA 2163 

llllllllllll MM I III Mill II II I II Mill I llllllllll llllllllllll 

CCTTTGCAGGAGTGGATGATTTCTTCATTCCCCCAACAAACCTCATCATCAACCACATCA 1380 
GGTTTGGCTTCATCTTCAACAAGTCTGATCCTGCAGTCCACAAGGTGGACCTGGAAACAA 2223 

IIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIII 
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SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT: 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY : 
SB JCT : 
QUERY: 
SB JCT : 



1381 GGTTTGGCTTCATCTTCAACAAGTCTGATCCTGCAGTCCACAAGGTGGACCTGGAAACAA 1440 



2224 TGATGCCO 
IIIIIIM 



:c^HbAi 

iitTTii 



CCATCGGCCTGCACCACCATGGCTGCGTGC 

MlilllMIIIIIIIMIIIIIIIIIIIM 




CCATGGCAC 22 83 

lilllllll 



1441 TGATGCCCCTCAAGACCATCGGCCTGCACCACCATGGCTGCGTGCCCCAGGCCATGGCAC 1500 
2284 ACACCCACCTGGGCGGCTACTTCTTCATCCAGTGCCGACAGGACAGCCCCGCCTCTGCTG 2343 

IIIIIIIIIMIMIMIMIIMIIIillllllllilllllllllllllllllllllll 

1501 ACACCCACCTGGGCGGCTACTTCTTCATCCAGTGCCGACAGGACAGCCCCGCCTCTGCTG 1560 
2344 CCCGACAGCTGCTCGTTGACAGTGTCACAGACTCTGTGCTTGGCCCCAATGGTGATGTAA 2403 

IIMIMIIMIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIII 

1561 CCCGACAGCTGCTCGTTGACAGTGTCACAGACTCTGTGCTTGGCCCCAATGGTGATGTAA 1620 
2404 CAGGCACCCCACACACATCCCCCGACGGGCGCTTCATAGTCAGTGCTGCAGCTGACAGCC 2463 

IIIIIIIIMIIIIMIilMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 

1621 CAGGCACCCCACACACATCCCCCGACGGGCGCTTCATAGTCAGTGCTGCAGCTGACAGCC 1680 

2464 CCTGGCTGCACGTGCAGGAGATCACAGTGCGGGGCGAGATCCAGACCCTGTATGACCTGC 2523 

llllillllllllllllMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIilMII 
1681 CCTGGCTGCACGTGCAGGAGATCACAGTGCGGGGCGAGATCCAGACCCTGTATGACCTGC 1740 

2524 AAATAAACTCGGGCATCTCAGACTTGGCCTTCCAGCGCTCCTTCACTGAAAGCAATCAAT 2583 

1 1 1 1 M 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 

1741 AAATAAACTCGGGCATCTCAGACTTGGCCTTCCAGCGCTCCTTCACTGAAAGCAATCAAT 1800 

2584 ACAACATCTACGCGGCTCTGCACACGGAGCCGGACCTGCTGTTCCTGGAGCTGTCCACGG 2643 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIillllllllllllllllllllllllllllll 
1801 ACAACATCTACGCGGCTCTGCACACGGAGCCGGACCTGCTGTTCCTGGAGCTGTCCACGG 1860 

2644 GGAAGGTGGGCATGCTGAAGAACTTAAAGGAGCCACCCGCAGGGCCAGCTCAGCCCTNNN 2703 

llllllllllllllllllllllllllillllllllllllllllllllllllllllll 
1861 GGAAGGTGGGCATGCTGAAGAACTTAAAGGAGCCACCCGCAGGGCCAGCTCAGCCCTGGG 1920 

2704 NNNNTACCCACAGAATCATGAGGGACAGTGGGCTGTTTGGACAGTACCTCCTCACACCAG 2763 

Mllllllllllllllllllllllllllllllllilllllllllllllllllllll 
1921 GGGGTACCCACAGAATCATGAGGGACAGTGGGCTGTTTGGACAGTACCTCCTCACACCAG 1980 

2764 CCCGAGAGTCACTGTTCCTCATCAATGGGAGACAAAACACGCTGCGGTGTGAGGTGTCAG 2823 

IIIIIIIIIIIIIIIIIIIIIMIIIIilllllllllllllllllMIIIIIIIIIIIII 
1981 CCCGAGAGTCACTGTTCCTCATCAATGGGAGACAAAACACGCTGCGGTGTGAGGTGTCAG 204 0 

2824 GTATAAANNNNNNNACCACAGTGGTGTGGGTGGGTGAGGTATGAAGGGCCCAGAGCAGAG 2883 

lllllll lllllllllllllllllillllMIIIIIIIIIIIIIIIIIIIIII 
2041 GTATAAAGGGGGGGACCACAGTGGTGTGGGTGGGTGAGGTATGAAGGGCCCAGAGCAGAG 2100 

2 884 CCCTGGGCCAAGGAACACCCCCTAGTCCTGACACTGCAGCCTCAAGCAGGTACGCTGTAC 2 943 

IIIIIIMIIIIIIIIIIIIIIIIIIilllllllilllllllMIMIIIIIIMIIIII 

2101 CCCTGGGCCAAGGAACACCCCCTAGTCCTGACACTGCAGCCTCAAGCAGGTACGCTGTAC 2160 

2944 ATTTTTACAGACAAAAGCAAAAACCTGTACTCGCTTTGTGGTTCAACACTGGTCTCCTTG 3003 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIilllllllllllllllil 
2161 ATTTTTACAGACAAAAGCAAAAACCTGTACTCGCTTTGTGGTTCAACACTGGTCTCCTTG 2220 

3004 CAAGTTTCCTAGTATAAGGTATGCGCTGCTACCAAGATTGGGGTTTTTTCGTTAGGAAGT 3063 

MMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIillllMIIMIIIIIIIMIMM 

2221 CAAGTTTCCTAGTATAAGGTATGCGCTGCTACCAAGATTGGGGTTTTTTCGTTAGGAAGT 2280 
3064 ATGATTTATGCCTTGAGCTACGATGAGAACATATGCTGCTGTGTAAAGGGATCATTTCTG 3123 

IIIIIIIIIMMIIMMMIIIIIIIMIIIilllllllMlllllllllilllllll 

2281 ATGATTTATGCCTTGAGCTACGATGAGAACATATGCTGCTGTGTAAAGGGATCATTTCTG 2340 
3124 TGCCAAGCTGCACACCGAGTGACCTGGGGACATCATGGAACCAAGGGATCCTGCTCTCCA 3183 

IIIIIIMMIIIIIillllllilMMIIIIIIIIIIMIIIIIIIIiillllllllll 

2341 TGCCAAGCTGCACACCGAGTGACCTGGGGACATCATGGAACCAAGGGATCCTGCTCTCCA 2400 
3184 AGCAGACACCTCTGTCAGTTGCCTTCACATAGTCATTGTCCCTTACTGCCAGACCCAGCC 3243 

IIIMIiilMIIIIIIIIIMIIIIIIIIIIMIIMIIIMIIMIIIIIIilMlil 

2401 AGCAGACACCTCTGTCAGTTGCCTTCACATAGTCATTGTCCCTTACTGCCAGACCCAGCC 2460 
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3244 AGACTTTGCCCTGACGGAGTGGCCCGGAAGCAGAGGCCGACCAGGAG* 

IIMIIMIii^lllllllllllilMllllllillllllllil 

2461 AGACTTTGCl^^^CGGAGTGGCCCGGAAGCAGAGGCCGACCAGG. 



AGCAGGGGi 

rnmiw 

-^^GGGGi 



.GGGGCCTCCCT 3303 

iliiiM 

CCTCCCT 2520 



QUERY: 
SB JOT : 

QUERY: 3304 CCCGAACTGAAAGCCCATCCGTCCTCGCGTGGGACCGCATCTTCTCCCTCGCAGCTGCTT 33 63 

lllllillllllilllllllMllllliliMMIIIIIMMIIMIMIIIIilllll 

SBJCT: 2521 CCCGAACTGAAAGCCCATCCGTCCTCGCGTGGGACCGCATCTTCTCCCTCGCAGCTGCTT 2580 

QUERY: 3364 CTTGCTTTTCTTTCCATTTGACTTGCTGTAAGCCTGAGGGAGAGCCAACAAGACTTACTG 3423 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIMIIIIIIIilll 
SBJCT: 2581 CTTGCTTTTCTTTCCATTTGACTTGCTGTAAGCCTGAGGGAGAGCCAACAAGACTTACTG 2640 

QUERY: 3424 CATCTTGGGGGATGGGGAAATCACTCACTTTATTTTGGAAATTTTTGATTNNNNNNN^^ 3483 

IIIIIIIIIIMMIIilllllliMIIIMIMIIIIIMMIIIIIII I 

SBJCT: 2641 CATCTTGGGGGATGGGGAAATCACTCACTTTATTTTGGAAATTTTTGATTAAAAAAAAAT 2 700 
QUERY: 3484 TTTATAATCTCAAATGCTAGTAAGCAGAAAGATGCTCTCCGAGGTCCAACTATATCCTTC 3543 

IIIIIIIIIIIIIIIIMMIIMIMIIIMIIIIIIIIIIIIIIIMIIIIIIIIIII 

SBJCT: 2701 TTTATAATCTCAAATGCTAGTAAGCAGAAAGATGCTCTCCGAGGTCCAACTATATCCTTC 2 760 

QUERY: 3 544 CCTGCCTTAGGCCGAGTCTCGGGGGTGGTCACAACCCCACATCCCACAGCCAGAAAGAAC 3 603 

IIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMillil 
SBJCT: 2761 CCTGCCTTAGGCCGAGTCTCGGGGGTGGTCACAACCCCACATCCCACAGCCAGAAAGAAC 2 820 

QUERY: 3604 AATGGTCATCTGAGAATACTGGCCCTGTCGACTATTGCCACCCTGCTTCTCCAAGAGCAG 3 663 

MMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIII 
SBJCT: 2821 AATGGTCATCTGAGAATACTGGCCCTGTCGACTATTGCCACCCTGCTTCTCCAAGAGCAG 2880 

QUERY: 3 664 ACCAGGCCACCTCATCCGTAAGGACTCGGTTCTGTGTTGGGACCCCAAAAAACCAGAACA 3723 

IMIIIIIIIIIIIIIIIilllllllMIIIIIIIIIIIIIIIIMIIIIIIIIMIIM 

SBJCT : 2881 ACCAGGCCACCTCATCCGTAAGGACTCGGTTCTGTGTTGGGACCCCAAAAAACCAGAACA 294 0 

QUERY: 3724 AGTTCTGTGTGCCTCCTTTCAGCACAGAAGGGAGACATCTCATTAGTCAGGTCTGGTACC 3783 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
SBJCT: 2941 AGTTCTGTGTGCCTCCTTTCAGCACAGAAGGGAGACATCTCATTAGTCAGGTCTGGTACC 3000 

QUERY: 3784 CCAGATTCAGGGCAGACTGGGCTTGCCTGGCAAGGTATGGGTGGCCTCCAGGCTCAATGC 3 843 

MllllMlllllMlilllllllllilllllllllllllllllillllllllilillll 

SBJCT: 3001 CCAGATTCAGGGCAGACTGGGCTTGCCTGGCAAGGTATGGGTGGCCTCCAGGCTCAATGC 3060 

QUERY: 3 844 AGAAACCCCAAGGACACGAGTGGGGCCAGGTGAGTTCCTGAAGCTATACCTTTTCAAAAC 3 903 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIM 
SBJCT: 3061 AGAAACCCCAAGGACACGAGTGGGGCCAGGTGAGTTCCTGAAGCTATACCTTTTCAAAAC 3120 

QUERY : 3 904 AGATTTTGTTTTCCTACCTGTGGCCCATCCACTCCTCTCTGGTACCCCATCCCCGCATCA 3 963 

lllllllllllillllllllllllllllllllllllllllllllllllllllllllllM 
SBJCT: 3121 AGATTTTGTTTTCCTACCTGTGGCCCATCCACTCCTCTCTGGTACCCCATCCCCGCATCA 3180 

QUERY : 3 964 GCACTGCAGAGAGAACACATTTCGGCGAGGGTTTTCTTACCCACATTCCCCAATCAATAC 4 023 

MIMIMIIIIIIIIIIIIIIMIIIMMIMIIIIMMIIIIMIIIIIIIIIIil 

SBJCT: 3181 GCACTGCAGAGAGAACACATTTCGGCGAGGGTTTTCTTACCCACATTCCCCAATCAATAC 3 240 

QUERY: 4024 ACACACACTGCAGAACCCAGAACAGAAGGCCACAGGCTGGCACTACTGCATTCTCCTTAT 4083 

lllllllllllillllllllMllllillllllllllllllllllllMIIIIIIIIIM 
SBJCT: 3241 ACACACACTGCAGAACCCAGAACAGAAGGCCACAGGCTGGCACTACTGCATTCTCCTTAT 3300 

QUERY: 4084 GTGTCTCAGGCTGTGGTGACTCTCACATGGGCATCGAAGAAGTACAACCCACATAGCCCT 4143 

IMiillllllllllllllllllllMllllllllllMillllllilllllllilllll 

SBJCT: 3301 GTGTCTCAGGCTGTGGTGACTCTCACATGGGCATCGAAGAAGTACAACCCACATAGCCCT 3360 

QUERY: 4144 CTGGAGACCGCCTAGATCAGAGACTCAGCAAAAACAGGCTCGCCTTCCCTCTCCCACATA 4203 

IIIMIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIilllllllllllllllllllli 
SBJCT: 3361 CTGGAGACCGCCTAGATCAGAGACTCAGCAAAAACAGGCTCGCCTTCCCTCTCCCACATA 3420 

QUERY: 4204 TGAGTGGAACrTACATGTGTCCTGGTTTGAATGATCATTTTGCAAGCCACACGGGTTGGG 4 263 

MllillllllllllllllllllllllllllllllllllllllllllllllllMIIIII 

SBJCT: 3421 TGAGTGGAACTTACATGTGTCCTGGTTTGAATGATCATTTTGCAAGCCACACGGGTTGGG 34 80 

QUERY: 4264 AGAGGTGGTCTCACCACAGACGTCTTTGCTAATTTGGCCACCTTCACCTACTGACATGAC 4323 
lllllllllllllllllllllllllllllllllillllllllllllllllllllllllM 
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SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SBJCT : 
QUERY : 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY : 
SBJCT : 
QUERY : 
SBJCT : 
QUERY : 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY : 
SBJCT : 
QUERY: 
SBJCT : 



34 81 AGAGGTGGTCTCACCACAGACGTCTTTGCTAATTTGGCCACCTTCACCTACTGACATGAC 354 0 



4324 CAGGATTTT^^PrGCCATTAAGGAATGAACTCTTTCAAGGAGAGd^ptCCTAGACTCT 4383 

I i 1 1 1 1 1 1 1 itTTm 1 1 1 1 M 1 1 1 i M M 1 1 1 1 1 1 1 1 1 M I M 1 1 1 m 1 1 1 M 

3541 CAGGATTTTCCTTTGCCATTAAGGAATGAACTCTTTCAAGGAGAGGAAACCCTAGACTCT 3600 
43 84 GTGTCACTCTCAACACACACAGCTCCTTTCACTCCTGCCTGACTGCCAAGCCACCTGCAT 4443 

IIIIIIIIIIMIIIMIIIIIIIIMIIIIIIIIIIMIIIMIIIIMIIIIIIIIII 

3601 GTGTCACTCTCAACACACACAGCTCCTTTCACTCCTGCCTGACTGCCAAGCCACCTGCAT 3660 
4444 CCCCCGCCCCAGATCTCATGAGATCAATCACTTGTATGTCTCACGCAACTTGGTCCACCA 4503 

MIIIIIIIIMMIMIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIII 

3661 CCCCCGCCCCAGATCTCATGAGATCAATCACTTGTATGTCTCACGCAACTTGGTCCACCA 3 72 0 
4 504 AACGCCTGTCCCCTGTAACTCCTAGGGGTGCGCCTAGACAGGTACGTCTGTTTTTTATTT 4563 

llllllllliMllllllllllllllllllllllllllillillllllllllllllllll 

3721 AACGCCTGTCCCCTGTAACTCCTAGGGGTGCGCCTAGACAGGTACGTCTGTTTTTTATTT 3780 

4564 TAAAAGATATGCTATGTAGATATAAGTTGAGGAAGCTCACCTCAAAAGCCTAGAATGCAG 4623 

IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIillllllllllllllllllMI 
3781 TAAAAGATATGCTATGTAGATATAAGTTGAGGAAGCTCACCTCAAAAGCCTAGAATGCAG 384 0 

4624 TTTCACAGTAGCTGGGATGCATGGATGACCCATCTCACCCC^FNNNNNNNNCCTGCCTC^ 4683 

IlilllllMIIMIMIIIIIIIillllllllllllllll llllllllll 

3841 TTTCACAGTAGCTGGGATGCATGGATGACCCATCTCACCCCTTTTTTTTTCCTGCCTCAA 3 900 

4684 TATCTTGATATGTTATGTTTACTCCCAATCTCCCATTTTTACCACTAAAATTCTCCAACT 4 74 3 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 
3901 TATCTTGATATGTTATGTTTACTCCCAATCTCCCATTTTTACCACTAAAATTCTCCAACT 3 960 

4744 TTCATAAACNNNNNNNNGGAT^AAATTTCCATTGTATCAGCCCCTGACAGAAAAAGGATCT 4 803 

Illllllll IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 
3961 TTCATAAACTTTTTTTTGGAAAAATTTCCATTGTATCAGCCCCTGACAGAAAAAGGATCT 4 020 

4 804 CTGAGCCTAAAGGAGGAAAAGTCCCACCAACTACCAGACCAGAACACGAGCCCCTCTGGG 4 863 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
4021 CTGAGCCTAAAGGAGGAAAAGTCCCACCAACTACCAGACCAGAACACGAGCCCCTCTGGG 4080 

4 864 CAGCAGGATTCCTAAGTCAAAGACCAGTTTGACCCAAACTGGCCTTTTAAAATAATCAGG 4 923 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 
4 081 CAGCAGGATTCCTAAGTCAAAGACCAGTTTGACCCAAACTGGCCTTTTAAAATAATCAGG 414 0 

4 924 AGTGACAGAGTCAACTTCTGCAGCACCTGCTTCTCCCCCACTGTCCCTTCCATCTTGGAA 4 983 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIM Illllllll III IIIIIIIIIIIMII 

4141 AGTGACAGAGTCAACTTCTGCAGCACCTGCTTCTCCCCCACTGTCCCTTCCATCTTGGAA 4200 

4 984 TGTGTCTAAAAAAGCATAGCTGCCCTTTGCTGTCCTCAGAGTGCATTTCCTGGAGACGGC 504 3 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

42 01 TGTGTCTAAAAAAGCATAGCTGCCCTTTGCTGTCCTCAGAGTGCATTTCCTGGAGACGGC 4260 

5044 AGGCTTAGGTCTCACTGACAGCATGCCAGACACAACTGAATCGAAGCAGGCCTGAAGCCT 5103 

llilllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
4261 AGGCTTAGGTCTCACTGACAGCATGCCAGACACAACTGAATCGAAGCAGGCCTGAAGCCT 432 0 

5104 AGGTCAGGGTTTCAGGAGTCCAGCCCCAGGAGGCAAAGTCACCAATGCAGGGAGGTAAAT 5163 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIII 
4321 AGGTCAGGGTTTCAGGAGTCCAGCCCCAGGAGGCAAAGTCACCAATGCAGGGAGGTAAAT 4380 

5164 GCCTTTTGGCAGGAAAACCAATAGAGTTGGTTGGGTGGGGAGTCAGGGGTGGGAGGAGAA 5223 

llllllilMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

43 81 GCCTTTTGGCAGGAAAACCAATAGAGTTGGTTGGGTGGGGAGTCAGGGGTGGGAGGAGAA 444 0 
5224 GGAGGAAGAGGAGGAAGGCCAGACTGGCCTGCCCTTTCTCCCATACTTCACCCCAGCAGA 5283 

llllllllllllillllllllllllllllllllllllllillllllllllllllllllll 

4441 GGAGGAAGAGGAGGAAGGCCAGACTGGCCTGCCCTTTCTCCCATACTTCACCCCAGCAGA 4500 

5284 GGTTCATGGGACACAGTTGGAAAGCCACTGGGAGGAAATGCCTCACTACAGGGGGGCCTC 5343 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
4 501 GGTTCATGGGACACAGTTGGAAAGCCACTGGGAGGAAATGCCTCACTACAGGGGGGCCTC 4 560 



\G(I^^C 
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10 



QUERY: 


5344 


SB JCT : 


4561 


QUERY: 


5404 


SB JCT : 


4621 


QUERY: 


5464 


SB JCT: 


4681 



CTGTAGCAAGCCCAGCCGGTAATCCTCCTAATGAACCCACAAGGTi 

IIIIIIIIIMIIIIIIIIIIIIIIIIIMIMIIIIIIIII 

CTGTAGCAA^M^GCCGGTAATCCTCCTAATGAACCCACAAGGTi 



CAATTi 



3TCACAACTGAT 5403 

Illlllllll 
CACAACTGAT 4620 



5463 



ATCTTAGCTATTAAAGAAGTACTGACTTTACCAAAAGAATCATCAAGAAAGCTATTTATA 

lllllllllllllllilllllllllllllllllllllllllllillllllllllllllll 
ATCTTAGCTATTAAAGAAGTACTGACTTTACCAAAAGAATCATCAAGAAAGCTATTTATA 4680 

TAAACCCCCTCAGTCATTTTGAAATAAAATTAATTTTAC 5502 

lllllllllllllllllllllllllllllllllllllll 
TAAACCCCCTCAGTCATTTTGAAATAAAATTAATTTTAC 4 719 
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The FCTR2 amino acid sequence has 473 of 810 amino acid residues (58%) identical to, 
and 616 of 810 residues (76%) positive with, the 850 amino acid residue proteins from Homo 
sapiens KIAA1263 Protein fragment (ptnr: TREMBLNEW-ACC:BAA86577) (SEQ ID NO:47) 
(Table 2D). 

Table 2D. BLASTP of FCTR2 against Homo sapiens KIAA1263 Protein fragment (SEQ ID 

NO:47) 

ptnr:TREMBLNEW-ACC:BAA86577 K1AA1263 PROTEIN - Homo sapiens (Human), 850 aa 
(fragment) 

Length = 850 

Score = 2573 (905.7 bits). Expect = 2.0e-267, P = 2.0e-2S7 
Identities = 473/810 (58%), Positives = S16/810 (7S*) 



QUERY: 


10 


SB JCT : 


40 


QUERY: 


70 


SB JCT : 


100 


QUERY: 


130 


SB JCT : 


160 


QUERY: 


189 


SB JCT : 


219 


QUERY: 


249 


SB JCT: 


279 


QUERY: 


309 


SB JCT : 


339 


QUERY : 


369 


SB JCT: 


399 


QUERY: 


429 


SB JCT : 


459 


QUERY: 


489 


SB JCT : 


519 



LFRLSLKRALSSCPDLFGLSSRNELLASCGKKFCSRGSRCVLSRKTGEPECQCLEACRPS 

I II I + I II IH I II Ihik II h+ k 

LMRLRHKEKNQESSRVKGFMIQDGPFGSCENKYCGLGRHCVTSRETGQAECACMDLCKRH 



69 



99 



YVPVCGSDGRFYENHCKLHRAACLLGKRITVIHSKDCFLKGDTCTMAGYARLKNVLLALQ 129 

I IIIIIM lllllhHIIIII +^||+H++||| III I h^^thll It 

YKPVCGSDGEFYENHCEVHRAACLKKQKITIVHNEDCFFKGDKCKTTEYSKMKNMLLDLQ 159 
TRLQPLQEGDSRQ- DPASQKRLLVESLFRDLDADGNGHLSSSELAQHVLKKQDLDEDLLG 188 

+ +11 ^+ I hhllh +k III II + +111 I + I+++I +1! 

NQKYIMQENENPNGDDISRKKLLVDQMFKYFDADSNGLVDXNELTQ-VIKQEELGKDLFD 218 

CSPGDLLRFDDYNSDSSLTLREFYMAFQWQLSLAPEDRVSVTTVTVGLSTVLTCAVHGD 24 8 

1+ II++II+I+I I I III lill+llll + + I I III I II 11+ I 
CTLYVLLKYDDFNADKHLALEEFYRAFQVIQLSLPEDQKLSITAATVGQSAVLSCAIQGT 278 

LRPPIIWKRNGLTLNFLDLEDINDFGEDDSLYITKVTTIHMGNYTCHASGHEQLFQTHVL 3 08 

Illlllllll + II llllllllll+l IMIIIIII l+lllll+l I+II++III+ 
LRPPIIWKRNNIILNNLDLEDINDFGDDGSLYITKVTTTHVGNYTCYADGYEQVYQTHIF 33 8 

QVNVPPVIRVYPESQAQEPGVAASLRCHAEGIPMPRITWLKNGVDVSTQMSKQLSLLANG 368 

llllllllllllllll+llll lllllllllll I++ IIIII+I++ ++IIII+I III 
QVNVPPVIRVYPESQAREPGVTASLRCHAEGIPKPQLGWLKNGIDITPKLSKQLTLQANG 398 

SELHISSVRYEDTGAYTCIAKNEVGVDEDISSLFIEDSARKTLANILWREEGLSVGNMFY 428 

ll+lll+IIIIIIIIIIMIIM llllllllll+llllllllllllllllil +IIIII 
SEVHISNVRYEDTGAYTCIAKNEAGVDEDISSLFVEDSARKTLANILWREEGLGIGNMFY 4 58 

VFSDDGIIVIHPVDCEIQRHLKPTEKIFMSYEEICPQREKNATQPCQWVSAVNVRNRYIY 488 

II +111 II I++II III+II+II+ +I+II+ I + I I I IIIII++++II 
VFYEDGIKVIQPIECE FQRH I KPS EKLLGFQDEVCPKAEGDEVQRCVWAS AVNVKDKF I Y 5 1 8 

VAQPALSRVLVVDIQAQKVLQSIGVDPLPAKLSYDKSHDQVWVLSWGDVHKSRPSLQVIT 548 

Mil I III + II + I + III + I++ M + l II llllllllllllll + 1+ l + lllll 
VAQPTLDRVL I VDVQSQKWQAVSTDPVPVKLHYDKSHDQVWVLSWGTLEKTS PTLQVIT 578 



QUERY: 54 9 EASTGQSQHLIRT PFAGVDDFFIPPTNLIINHIRFGFIFNKSDPAVHKVDLETMM 603 
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SB JCT : 


579 


QUERY: 


604 


SB JCT : 


639 


QUERY: 


664 


SB JCT : 


699 


QUERY: 


724 


SB JCT : 


759 


QUERY: 


784 


SB JCT : 


819 



II 

LASGNVPHHT, 



gWpq. 



I IIIIIM I III Mllll 
vgkqfdrvddffiptttliithmrfgfilh: 



I 



klllll 

QKIDLETMS 



638 



663 



P LKT I GLHHHGCvPQAMAHTHLGG YF F I QCRQDS PAS AARQLLVDS VTDS VLGPNGDVTG 

Hill ^ lltl^-^hlllllhll 1+ II + + kHI llllhl I MM 

YIKTINLKDYKCVPQSLAYTHLGGYYFIGCKPDSTGAVSPQVMVDGVTDSVIGFNSDVTG 698 
TPHTSPDGRFIVSAAADSPWLHVQEITVRGEIQTLYDLQINSGISDLAFQRSFTESNQYN 723 

Ih MM ^^11 ^ II IMIIIII I MIMII IIIM^III 

TPYVSPDGHYLVSINDVKGLVRVQYITIRGEIQEAFDIYTNLHISDLAFQPSFTEAHQYN 758 
lYAALHTEPDLLFLELSTGKVGMLKNLKEPPAGPAQPWGGTHRIMRDSGLFGQYLLTPAR 783 

II ^ 1^ MMIIMII hMIII II +1 +^IIMIMIhlM^ 

lYGSSSTQTDVLFVELSSGKVKMIKSLKEPLKAEEWPWNRKNRQIQDSGLFGQYLMTPSK 818 



ESLFLINGRQNTLRCEVSGI KGGTTWWVGE 

Hlk^-^ll I I IM^ I Ihllk 

DSLFILDGRLNKLNCEITEVEKGNTVIWVGD 



814 



849 



Amino acids 123-815 of FCTR2 also have 693 of 693 amino acid residues (100%) 
identical to,the 693 amino acid residue protein fragment of KIAA1061 Protein from Homo 
sapiens (ptnr: TREMBLNEW-ACC: BAA83013) (SEQ ID NO:48) (Table 2E). 



Table 2E. BLASTP of FCTR2 against KIAA1061 Protein [Fragment] (SEQ ID NO:48) 

ptnr : TREMBLNEW-ACC :BAA8 3 013 KIAA1061 PROTEIN - Homo sapiens (Human), 
693 aa (fragment) . 

Length = 693 

Score = 3623 (1275.4 bits). Expect = 0.0, P = 0.0 
Identities = 693/693 (100%), Positives = 693/693 (100%) 



QUERY: 


123 


SB JCT : 


1 


QUERY: 


183 


SB JCT: 


61 


QUERY: 


243 


SB JCT : 


121 


QUERY: 


303 


SB JCT : 


181 


QUERY: 


363 


SB JCT : 


241 


QUERY: 


423 


SB JCT : 


301 


QUERY: 


483 


SB JCT: 


361 


QUERY: 


543 


SB JCT : 


421 



NVLLALQTRLQPLQEGDSRQDPASQKRLLVESLFRDLDADGNGHLSSSELAQHVLKKQDL 182 

IIIIIIIIIIIIIMIIIIIIIIMIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIII 

NVLLALQTRLQPLQEGDSRQDPASQKRLLVESLFRDLDADGNGHLSSSELAQHVLKKQDL 6 0 
DEDLLGCSPGDLLRFDDYNSDSSLTLREFYMAFQWQLSLAPEDRVSVTTVTVGLSTVLT 242 

II Mill Mill I Mil III IMIil MM llllll II MM IIIIIM III II I II III 

DEDLLGCSPGDLLRFDDYNSDSSLTLREFYMAFQWQLSLAPEDRVSVTTVTVGLSTVLT 120 

CAVHGDLRPPIIWKRNGLTLNFLDLEDINDFGEDDSLYITKVTTIHMGNYTCHASGHEQL 3 02 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIMIIIIIIIII lllllllllll 
CAVHGDLR P P II WKRNGLTLNFLDLED INDFGEDDS L Y I TKVTT I HMGNYTCHASGHEQL 180 

FQTHVLQVNVPPVIRVYPESQAQEPGVAASLRCHAEGIPMPRITWLKNGVDVSTQMSKQL 362 

II Mill I MM I II Mill llllll IIIIIIIIM II MM M Mill III II I II III 

FQTHVLQVNVPPVIRVYPESQAQEPGVAASLRCHAEGIPMPRITWLKNGVDVSTQMSKQL 240 
SLLANGSELHISSVRYEDTGAYTCIAKNEVGVDEDISSLFIEDSARKTLANILWREEGLS 422 

MIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIMIillllllllllllll 

SLLANGSELHISSVRYEDTGAYTCIAKNEVGVDEDISSLFIEDSARKTLANILWREEGLS 300 

VGNMFYVFSDDGIIVIHPVDCEIQRHLKPTEKIFMSYEEICPQREKNATQPCQWVSAVNV 4 82 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 
VGNMFYVFSDDGIIVIHPVDCEIQRHLKPTEKIFMSYEEICPQREKNATQPCQWVSAVNV 360 

RNRYIYVAQPALSRVLWDIQAQKVLQSIGVDPLPAKLSYDKSHDQVWVLSWGDVHKSRP 542 

IIIIIIIMIilllllllllllllllllllllllllllllllllllllllllMIIIMI 

RNRYI YVAQPALSRVLWDIQAQKVLQS IGVDPLPAKLSYDKSHDQVWVLSWGDVHKSRP 420 
SLQVITEASTGQSQHLIRTPFAGVDDFFIPPTNLIINHIRFGFIFNKSDPAVHKVDLETM 602 

II III II I MM I llllll III MM Mill Mill II llllll III II I II MIMIII 

SLQVITEASTGQSQHLIRTPFAGVDDFFIPPTNLIINHIRFGFIFNKSDPAVHKVDLETM 4 80 
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QUERY: 


603 


SB JCT : 


481 


QUERY: 


663 


SB JCT : 


541 


QUERY: 


723 


SB JCT : 


601 


QUERY: 


783 


SB JCT: 


661 



MPLKTIGLHM^MpQAMAHTHLGGYFFIQCRQDSPASAARQLLVD^^^VLGPNGDVT 662 

MllllllllWlliillllNIIIIIIIIIIIIIIIIIMIIIlWllllllll^ 

MPLKTIGLHHHGCVPQAMAHTHLGGYFFIQCRQDSPASAARQLLVDSVT0SVLGPNGDVT 540 



GTPHTSPDGRFIVSAAADSPWLHVQEITVRGEIQTLYDLQINSGISDLAFQRSFTESNQY 

IIIMMIIIIIIIIMIIIIIIMIIIIIIIIIIIMIMIIIIIIillllllllMII 

GTPHTS PDGRF I VSAAADS P WLHVQE I TVRGE I QTLYDLQ INSG I S DLAFQRS FTESNQY 



722 



600 



782 



NIYAALHTEPDLLFLELSTGKVGMLKNLKEPPAGPAQPWGGTHRIMRDSGLFGQYLLTPA 

IIIIIIIIIIIIIIIIIIIMIIMMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 
NIYAALHTEPDLLFLELSTGKVGMLKNLKEPPAGPAQPWGGTHRIMRDSGLFGQYLLTPA 660 



815 



RESLFLINGRQNTLRCEVSGIKGGTTWWVGEV 

Mill 1 1 Mill 1 11 II MM Ml II II I INI 

RESLFLINGRQNTLRCEVSGIKGGTTWWVGEV 693 



The amino acid sequence of the FCTR2 protein has 451 of 772 amino acid residues 
(58%) identical to, and 586 of 772 residues (75%) positive with, the 773 amino acid residue 
proteins hypothetical protein DKFZp566D234.1 from Homo sapiens (fragments) (ptnr: 
SPTREMBL-ACC: CAB70877.1) (SEQ ID NO:49) (Table 2F). 

Table 2F. BLASTP of FCTR2 against hypothetical protein DKFZp566D234.1 (SEQ ID 

NO:49) 

> GI 1 11360192 I PIR| IT462B3 HYPOTHETICAL PROTEIN DKFZP566D234 . 1 - HUMAN (FRAGMENTS) 
GI I 6808053 I EMB I CAB708 77 . 1 I (AL137695) HYPOTHETICAL PROTEIN [HOMO SAPIENS] 
LENGTH = 773 

SCORE = 911 BITS (2354), EXPECT = 0.0 

IDENTITIES = 451/772 (58%), POSITIVES = 586/772 (75%), GAPS = 7/772 (0%) 

CVLSRKTGEPECQCLEACRPSYVPVCGSDGRFYENHCKLHRAACLLGKRITVIHSKDCFL 108 

II Ihik II k I lllllll lllllhHIIIII +-HII++I+HII 
CVTSRETGQAECACMDLCKRHYKPVCGSDGEFYENHCEVHRAACLKKQKITIVHNEDCFF 61 

KGDTCTMAGYARLKNVLLALQTRLQPLQEGDSRQ- DPASQKRLLVESLFRDLDADGNGHL 167 

III I ^-Hl^ll II + ^1! I kkllh +k III I + 

KGDKCKTTECSKMKNMLLDLQNQRYIMQENENPNGDDISRKKLLVDQMFKYFDADSNDLV 121 
SSSELAQHVLKKQDLDEDLLGCSPGDLLRFDDYNSDSSLTLREFYMAFQWQLSLAPEDR 227 

+ 11 I hk^-^i +11 1+ ll++lhl + l I I III llll + llll + + 

DINELTQ-VIKQEELGKDLFDCTLYVLLKYDDFNADKHLALEEFYRAFQVIQLSLPEDQK 180 

XXXXXXXXXXXXXXXCAVHGDLRPPIIWKRNGLTLNFLDLEDINDFGEDDSLYITKVTTI 287 

11+ I llllllllll + II I llllllll+l lilllllll 
LSITAATVGQSAVLSCAIQGTLRPPIIWKRNNIILNNLGLEDINDFGDDGSLYITKVTTT 240 

HMGNYTCHASGHEQLFQTHVLQVNVPPVIRVYPESQAQEPGVAASLRCHAEGIPMPRITW 347 

l+lllll+l l+il++lll+ llllllllllllllll+llli lllllllllll I++ I 
HVGNYTCYADGYEQVYQTHIFQVNVPPVIRVYPESQAREPGVTASLRCHAEGIPKPQLGW 300 

LKNGVDVSTQMSKQLSLLANGSELHISSVRYEDTGAYTCIAKNEVGVDEDISSLFIEDSA 407 

IIII+I++ ++IIII+I lllll+lil+llllllllllllllll llllllllll+llll 
LKNGIDITPKLSKQLTLQANGSEVHISNVRYEDTGAYTCIAKNEAGVDEDISSLFVEDSA 360 

RKTLANILWREEGLSVGNMFYVFSDDGIIVIHPVDCEIQRHLKPTEKIFMSYEEICPQRE 467 

llllllllllllll +IIIIIII +111 II I++II III+II+II+ +I+II I 
RKTLANI LWREEGLG IGNMFYVFYEDG I KVI QP I ECEFQRH I KPS EKLLGFQDEVCP I AE 420 

KNATQPCQWVSAVNVRNRYI YVAQPALSRVLWDIQAQKVLQS IGVDPLPAKLSYDKSHD 527 

+ 111 IIIII++++IIIIII I III+II+I+III+I++ ii+i II mill 

GDEVQRCVWASAVNVKDKFI YVAQPTLDRVLIVDVQSQKWQAVSTDPVPVKLHYDKSHD 4 80 



QUERY: 


49 


SB JCT : 


2 


QUERY: 


109 


SB JCT : 


62 


QUERY : 


168 


SB JCT : 


122 


QUERY: 


228 


SB JCT : 


181 


QUERY: 


288 


SB JCT : 


241 


QUERY: 


348 


SB JCT : 


301 


QUERY: 


408 


SB JCT : 


361 


QUERY: 


468 


SB JCT : 


421 
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f = ^ 



35 



QUERY: 528 QVWVLSWGDVHKSRPSLQVITEASTGQSQHLIRT PFAGVDDFFIPPTNLIINHIR 582 

IMIIIII ^Jlkl^lllll II III I lll|i|l III IH 

SBJCT: 4 81 QVWVLSWGTL^^PTLQVITLASGNVPHHTIHTQPVGKQFDRVDDI^Bh'TLI ITHMR 54 0 



5 QUERY: 583 FGFIFNKSDPAVHKVDLETMMPLKTIGLHHHGCVPQAMAHTHLGGYFFIQCRQDSPASAA 642 

MM ^1 Mllll HIM + lllhHHIIIIkll U II ^ + 

SBJCT: 541 FGFILHKDEAALQKIDLETMSYIKTINLKDYKCVPQSIAYTHLGGYYFIGCKPDSTGAVS 600 
QUERY: 643 RQLLVDSVTDSVLGPNGDVTGTPHTSPDGRFIVSAAADSPWLHVQEITVRGEIQTLYDLQ 702 

10 l+HI IMIM I lllllk MM +HI ^ II Ihlllll +k 

SBJCT: 601 PQVMVDGVTDSVIGFNSDVTGTPYVSPDGHYLVSINDVKGLVRVQYITIRGEIQEAFDIY 660 
QUERY: 703 INSGISDLAFQRSFTESNQYNIYAALHTEPDLLFLELSTGKVGMLKNLKEPPAGPAQPWG 762 

I IIIIMI lllkHIMI ^ k hllHIIHII IHHIII II 

15 SBJCT: 661 TNLHISDLAFQPSFTEAHQYNIYGSSSTQTDVLFVELSSGKVKMIKSLKEPLKAEEWPWN 720 
QUERY: 763 GTHRIMRDSGLFGQYLLTPARESLFLINGRQNTLRCEVSGIKGGTTWWVGE 814 

H ^HIMMIIhlh+Hlh+HI I I Ih-^ +M MHIK 

SBJCT: 721 RKNRQIQDSGLFGQYLMTPSKDSLFILDGRLNKLNCEITEVEKGNTVIWVGD 772 

20 

The amino acid sequence of the FCTR2 protein has 61 of 194 amino acid residues (31%) 
identical to, and 90 of 194 residues (45%) positive with, the 306 amino acid residue protein 
Follastin-Related Protein 1 Precursor from Rattus Norvegicus (ptnr: GenBank Acc:Q62632) 
(SEQ ID NO:50) (Table 2G). 

sa 25 Table 2G. BLAST? of FCTR2 against Follastatin-Related Protein 1 Precursor from Rattus 

m 

n Norvegicus (SEQ ID NO:50) 

[=1 > GI|2498392|SP|Q62632|FRP RAT FOLLI STATIN -RELATED PROTEIN 1 PRECURSOR 

f:=i Glll083669|PIR| IS51361 FOLLI STAT IN -RELATED PROTEIN PRECURSOR - RAT 

In GI| 536900 |GB|AAA66063 .ll (U06864) FOLLISTATIN- RELATED PROTEIN PRECURSOR [RATTUS 

30 NORVEGICUS] 

LENGTH =306 



□ SCORE = 86.4 BITS (213), EXPECT = lE-15 

IDENTITIES = 61/194 (31%), POSITIVES = 90/194 (45%), GAPS = 26/194 (13%)' 





QUERY : 


38 




SBJCT : 


29 


40 


QUERY: 


98 




SBJCT : 


88 


45 


QUERY : 


149 




SBJCT : 


144 




QUERY: 


198 


50 


SBJCT : 


203 



I II I I Ml 1 hi hi lllll-h I llhlll III I + 



ITVIHSKDCFLKGD TCTMAGYARLKNVLLA-LQTRLQPLQEGDSRQDPASQK 14 8 

II + I I II 1+ 1+ I I I 

I QVD YDGHCKE KKS VS PSAS P WCYQANRDELRRR 1 1 QWLEAE IIP DGWFS KGSNY 143 

RLLVESLFRDLDADGNGHLSSSELAQHVLK KQDLDEDLLGCSPGDLLRF 197 

1+ I +h II Ml + I ^ h -H. I I K 



DDYNSDSSLTLREF 211 
I hi h HI 



The amino acid sequence of the FCTR2 protein has 61 of 194 amino acid residues (31%) 
identical to, and 89 of 194 residues (45%) positive with, the 306 amino acid residue protein 
Follastin-Related Protein 1 Precursor from Mus musculus (GenBank Acc:Q62356) (SEQ ID 
55 N0:51) (Table 2H). 



15966-697 



Table 2H. BLASTP o^CTR2 against FoUastatin-Related P|^in 1 Precursor from Mus 
V mM5cw/tt5 (SEQ ID NO:51) W 

> GI I 667987llREFlNP 032073. l| FOLLISTATIN-LIKE [MUS MUSCULUS] 

Gl|249839llSP|Q62356|FRP MOUSE FOLLI STAT IN -RELATED PROTEIN 1 PRECURSOR (TGF-BETA- 
INDUCIBLE PROTEIN 
TSC-36) 

Gl|481186lPIRl IS38251 FOLLI STAT IN -RELATED PROTEIN - MOUSE 

GI I 349006 |GB| AAC37633 . 1 1 (M91380) TGF- BETA- INDUCIBLE PROTEIN [MUS MUSCULUS] 
LENGTH =306 

SCORE = 85.2 BITS (210), EXPECT = 3E-15 

IDENTITIES = 61/194 (31%), POSITIVES = 89/194 (45%), GAPS = 26/194 (13%) 

QUERY: 3 8 CGKKFCSRGSRCVLSRKTGEPECQCLEACRPSYVPVCGSDGRFYENHCKLHRAACLLGKR 97 

I II I I I III I 1^1 hi llllhh I llhlll III I + 

SBJCT: 29 CANVFCGAGRECAVTEK-GEPTCLCIEQCKPHKRPVCGSNGKTYLNHCELHRDACLTGSK 87 

QUERY: 98 ITVIHSKDCFLKGDT CTMAGYARLKNVLLA-LQTRLQPLQEGDSRQDPASQK 148 

11+11 II 1+ k U + I I I 

SBJCT: 88 IQVDYDGHCKEKKSASPSASPWCYQANRDELRRRLIQWLEAEI IP DGWFSKGSNY 143 

QUERY: 149 RLLVESLFRDLDADGNGHLSSSELAQHVLKK QDLDEDLLGCSPGDLLRF 197 

+++ K I II III ^ I h ^+ I h 

SBJCT: 144 SEILDKYFKSFD-NGDSHLDSSEFLKFVEQNETAINITTYADQENNKLLRSLCVDALIEL 202 

QUERY: 198 DDYNSDSSLTLREF 211 

I hi h +11 
SBJCT: 203 SDENADWKLSFQEF 216 

The amino acid sequence of the FCTR2 protein has 63 of 193 amino acid residues (32%) 
identical to, and 89 of 193 residues (45%) positive with, the 299 amino acid residue protein 
Follastatin-Related Protein from the African Clawed Frog (GenBank Acc:JG0187) (SEQ ID 
NO:52) (Table 21). 

Table 21. BLASTP of FCTR2 against FoUastatin-Related Protein from the African Clawed 

Frog (SEQ ID NO:52) 

> GI|7512162|PIR| |JG0187 FOLLI STAT IN -RELATED PROTEIN - AFRICAN CLAWED FROG 
LENGTH = 299 

SCORE = 81.8 BITS (201), EXPECT = 3E-14 

IDENTITIES = 63/193 (32%), POSITIVES = 89/193 (45%), GAPS = 25/193 (12%) 



QUERY 
SBJCT 
QUERY 
SBJCT 
QUERY 
SBJCT 
QUERY 
SBJCT 



3 8 CGKKFCSRGSRCVLSRKTGEPECQCLEACRPSYVPVCGSDGRFYENHCKLHRAACLLGKR 97 

I II I I ^-^ I hll IH h llllhh I llhlll III I ^ 

28 CANVFCGAGRECAVTEK-GDPTCDCIEKCKSHKRPVCGSNGKTYLNHCELHRDACLTGSK 86 



98 



87 



ITVIHSKDCFLK-GDT CTMAGYARL-KNVLLALQTRLQPLQEGDSRQDPASQK 148 

I I + I I II I ^ ^ k III + 1 I I 

IQVDYDGHCKEKTSDTPAAVPVACYQSDRDEMRRRVIHWLQTEITP DGWFSKGSDY 142 



149 RLLVESLFRDLDADGNGHLSSSELAQHVLKKQDL DED LLGCSPGDLLRFD 198 

+++ k Ilk II kll + I II- I h 

143 SEILDRYFKKFD-DGDSHLDSAELQSFLEQSQSTNITTYKDEETNRMLKSLCVEALIELS 201 

199 DYNSDSSLTLREF 211 
I kl I II 

202 DENADWKLNKNEF 214 
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The amino acid science of the FCTR2 protein has 59 o^^ amino acid residues (30%) 
identical to, and 90 of lURsidues (45%) positive with, the 308 »lmo acid residue protein 
FoUistatin-Related Protein 1 Precursor from Homo sapiens (GenBank Acc:Q 12841) (SEQ ID 
NO:53) (Table 2J). 

Table 2 J. BLAST? of FCTR2 against Follistatin-Related Protein 1 Precursor from Homo 

sapiens (SEQ ID NO:53) 

> GI|5901956|REF|NP 009016. ll FOLLISTATIN-LIKE 1 [HOMO SAPIENS] 
Gl|2498390lSP|O1284l|FRP HUMAN FOLLISTATIN-RELATED PROTEIN 1 PRECURSOR 
GI|1082372|PIR| IS51362 FOLLISTATIN-RELATED PROTEIN - HUMAN 

Gll536898lGB|AAA66062.ll {U06863) FOLLISTATIN-RELATED PROTEIN PRECURSOR [HOMO 
SAPIENS] 

GI |31843 93 |DBJlBAA28707.l| (D89937) FOLLISTATIN-RELATED PROTEIN (FRP) [HOMO SAPIENS] 
GI I 12652619|GB| AAH00055. llAAH00055 (BC000055) FOLLISTATIN-LIKE 1 [HOMO SAPIENS] 
LENGTH = 308 

SCORE = 82.9 BITS (204), EXPECT ^ lE-14 

IDENTITIES = 59/194 (30%), POSITIVES = 90/194 (45%), GAPS = 26/194 (13%) 



QUERY 
SBJCT 
QUERY 
SBJCT 
QUERY 
SBJCT 
QUERY 
SBJCT 



3 8 CGKKFCSRGSRCVLSRKTGEPECQCLEACRPSYVPVCGSDGRFYENHCKLHRAACLLGKR 9 7 

I II I I MM I M M llllkh I IIIHII III I ^ 

31 CANVFCGAGRECAVTEK-GEPTCLCIEQCKPHKRPVCGSNGKTYLNHCELHRDACLTGSK 89 

98 ITVIHSKDCFLKGD TCTMAGYARLKNVLLA-LQTRLQPLQEGDSRQDPASQK 14 8 

II ^ I I I + K ^+ h ^ I I I 

9 0 IQVDYDGHCKEKKS VS PSAS P WCYQSNRDELRRRI IQWLEAE IIP DGWFS KGSNY 145 

149 RLLVESLFRDLDADGNGHLSSSELAQHVLKK QDLDEDLLGCSPGDLLRF 197 

+++ I III M + h +M I h 

146 SEILDKYFKNFD-NGDSRLDSSEFLKFVEQNETAINITTYPDQENNKLLRGLCVDALIEL 204 

198 DDYNSDSSLTLREF 211 

I l + l k -^11 
205 SDENADWKLSFQEF 218 



The amino acid sequence of the FCTR2 protein has 35 of 69 amino acid residues (50%) 
identical to, and 45 of 69 residues (64%) positive with, the 3 1 5 amino acid residue Flik protein 
[Gallus gallus] (EMBL Acc:CAB42968.1) (SEQ ID NO:54) (Table 2K). 

Table 2K. BLASTP of FCTR2 against Flik protein [Gallus gallus] (SEQ ID NO:54) 

> Gl|4837645|EMB|CAB42968.ll (AJ238977) FLIK PROTEIN [GALLUS GALLUS] 
LENGTH = 315 

SCORE = 79.8 BITS (196), EXPECT = lE-13 

IDENTITIES = 35/69 (50%), POSITIVES = 45/69 (64%), GAPS = 1/69 (1%) 
QUERY: 38 CGKKFCSRGSRCVLSRKTGEPECQCLEACRPSYVPVCGSDGRFYENHCKLHRAACLLGKR 97 

I M Ik I I III I M IH IIIIMk I llhlll III I + 

SBJCT: 31 CANVFCGRGAECAVTEK-GEPTCLCIEQCKPHGRPVCGSNGKTYLNHCELHRDACLTGSK 89 

QUERY: 98 ITVIHSKDC 106 

M ^ I 
SBJCT: 90 IQVDYDGHC 98 



The amino acid sequence of the FCTR2 protein has 49 of 152 amino acid residues (32%) 
identical to, and 65 of 152 residues (42%) positive with a 272-420 amino acid fragment and, 31 
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of 83 residues (37%) idei^al to and 44 of 83 residues (52%) po^^mQ with a 248-329 amino 
acid fragment, both of tSRTS amino acid residue Frazzled gen^^tein [Drosophila 
melanogaster] (GenBankAcc:T13822) (SEQ ID NO:55) (Table 2L). 

Table 2L. BLASTP of FCTR2 against Frazzled gene protein [Drosophila melanogaster] 
5 (SEQ ID NO:55) 

> GI|7511861|PIR| IT13822 FRAZZLED GENE PROTEIN - FRUIT FLY (DROSOPHILA MELANOGASTER) 
GI I 1621115lGBlAAC47314 .l| (U71001) FRAZZLED [DROSOPHILA MELANOGASTER] 
LENGTH = 1375 



10 SCORE = 69.4 BITS (169), EXPECT = 2E-10 

IDENTITIES = 49/152 (32%), POSITIVES = 65/152 (42%), GAPS = 4/152 (2%) 





QUERY : 


243 


CAVHGDLRPPI I WKRNGLTLNFLDLEDINDFGEDDSLYITKVTTIHMGNYTCHASGH- EQ 


301 


15 






1 H H 1 1 Ilk hi Ih II h 1 III 1 II ^ 




SB JCT : 


272 


CVANGVPKPQIKWLRNGMDLDFNDLDSRFSIVGTGSLQISSAEDIDSGNYQCRASNTVDS 


331 




QUERY : 


302 


LFQTHVLQVNVPPVIRVYPESQAQEPGVAASLRCHAEGIPMPRITWLKNGVDVSTQMSKQ 
1 HI II 1^ hi 1 1 1 1 Mill +^ t 


361 


20 


SB JCT : 


332 


LDAQATVQVQEPPKFIKAPKDTTAHEKDEPELKCDIWGKPKPVIRWLKNGDLITPNDYMQ 


391 




QUERY: 


362 


LSLLANGSELHISSVRYEDTGAYTCIAKNEVG 393 






SB JCT : 


392 


1 H 1 1 + 1 1 1 1 
LVDGHNLKILGLLNSDAGMFQCVGTNAAG 42 0 




25 


SCORE 


= 52. 


.9 BITS (126), EXPECT = lE-05 






IDENTITIES = 31/83 (37%), POSITIVES = 44/83 (52%), GAPS = 2/83 (2%) 






QUERY: 


311 


NVP P VI RVYPE S QAQE PGVAAS LRCHAEG I PMPR I TWLKNGVDVS - TQMS KQLS LLANGS 


369 


30 






HI 11^ 1 H 1 1 hi hi Ihlhh ^ + h^ II 




SB JCT : 


248 


SVAPSFLVGPSPKTVREGDTVTLDCVANGVPKPQIKWLRNGMDLDFNDLDSRFSIVGTGS 


307 




QUERY: 


370 


ELHISSVRYEDTGAYTCIAKNEV 3 92 

MM hi 1 1 1 1 1 

-LQISSAEDIDSGNYQCRASNTV 329 






SB JCT : 


308 





35 



The amino acid sequence of the FCTR2 protein has 53 of 177 amino acid residues (29%) 
identical to, and 78 of 177 residues (43%) positive with a 366-539 amino acid fragment, 51 of 
170 residues (30%) identical to and 74 of 170 residues (43%) positive with a 276-438 amino 
acid fragment, 46 of 165 amino acid residues (27%) identical to, and 74 of 165 amino acid 
40 residues positive with a 185-341 amino acid fragment, 48 of 167 amino acid residues (28%) 
identical to and 70 of 167 amino acid residues (41%) positive v^th a 77-243 amino acid 
fragment, and 28 of 84 amino acid residues (33%) and 37 of 84 amino acid residues positive with 
a 56-139 amino acid fragment all of the protein 1395 residue Roundabout 1 protein [Drosophila 
melanogaster] (GenBankAcc:AAC38849.1) (SEQ ID NO:56) (Table 2M). 

45 Table 2M. BLASTP of FCTR2 against Roundabout 1 protein [Drosophila melanogaster] 

(SEQ ID NO:56) 

> GI I 2804782 |GB|AAC38849.1| {AF040989) ROUNDABOUT 1 [DROSOPHILA MELANOGASTER] 
LENGTH = 1395 

50 SCORE =69.8 BITS (170), EXPECT = IE- 10 

IDENTITIES = 53/177 (29%), POSITIVES = 78/177 (43%), GAPS = 11/177 (6%) 
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10 



15 



QUERY: 


243 


SB JCT : 


366 


QUERY: 


297 


SB JCT : 


426 


QUERY: 


355 


SBJCT : 


486 


SCORE 


= 56 



CAVHGDLRPPj 
I h 

CMASGNPPPSVi 



vWtkei 



GL - TLN FLDLED IND F - GEDDS LY I TKVTT ^^lYTCHA 

h III ^ ^ I H II I Wl I I 

^TKEGVSTLMFPNSSHGRQYVAADGTLQITDVRQEDEGYYVCSAFSW 



- 296 



425 



-SGHEQLFQTHVLQVNVPPVIRVYPESQAQEPGVAASLRCHAEGIPMPRITWLKNGVDV 354 

I I ^ Ihh^ I +1 I hi I It I MM H I 



il I 



III 



I I 



■I ^1 



BITS (135), EXPECT = lE-06 
IDENTITIES = 51/170 (30%), POSITIVES = 74/170 (43%), GAPS = 12/170 (7%) 



20 



25 



v[| 30 



QUERY: 


243 


SBJCT : 


276 


QUERY: 


302 


SBJCT : 


333 


QUERY: 


359 


SBJCT : 


391 


SCORE 


= 51 



kl II I ++II+ 



I h II 



1+ H I I I I + I 



I I I I I I O I I III I 



I k MM I I I I ^ II 



•I++ I- 



IDENTITIES = 46/165 (27%) , POSITIVES = 74/165 (43%) , GAPS = 20/165 (12%) 











r-t: 




QUERY : 


251 


f=t 
is? 


35 


SBJCT : 


185 






QUERY: 


299 


fi% 












SBJCT: 


242 




40 


QUERY : 


359 






SBJCT: 


300 






SCORE 


= 44 



I +11 ++h k II- 



II I 



II + +1 h I I III I I 



I 1+ I I I I++ I I 



+1 + I lh+ I I I I I I II 



■II H- 



45 



IDENTITIES = 48/167 (28%), POSITIVES = 70/167 (41%), GAPS = 13/167 (7%) 



50 



55 



60 



65 



QUERY: 


243 


SBJCT : 


77 


QUERY: 


299 


SBJCT : 


137 


QUERY: 


355 


SBJCT : 


197 


SCORE 


= 42 



I I 



I I ++I 



I Ih I 



II h 



•I H-^ 



I I I I 



-III I 



-GNYTCHASG 298 
II II 



hll 



-LANGSELHISSVRYEDTGAYTCIAKNEVGVDE 3 96 
^ ^1 llhl I II llkl II I 



IDENTITIES = 28/84 (33%), POSITIVES = 37/84 (43%), GAPS = 4/84 (4%) 

QUERY: 314 PVIRVYPESQAQEPGVAASLRCHAEGIPMPRITWLKNGVDVSTQMSKQLSLLANGSELH- 372 

I I ^1 + hi I II I M I hi III I + I 

SBJCT: 56 PRIIEHPTDLWKKNEPATLNCKVEGKPEPTIEWFKDGEPVSTNEKKSHRVQFKDGALFF 115 

QUERY: 373 ISSVRYEDTGAYTCIAKNEVG 3 93 

^ ^11 I hlllll 

SBJCT: 116 YRTMQGKKEQDGGEYWCVAKNRVG 139 
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The amin^cid sequence of the FCTR2 protein h^^ of 157 amino acid residues 
(35%) identical to, and 157 residues (47%) positive with a ^^775 amino acid fragment, 
49 of 163 residues (30%) identical to and 71 of 163 residues (43%) positive with a 335-492 
amino acid fragment, 32 of 85 amino acid residues (37%) identical to, and 48 of 85 amino acid 
residues (55%) positive with a 1305-1388 amino acid fragment, 37 of 143 amino acid residues 
(25%) identical to and 60 of 143 amino acid residues (41%) positive with a 183-319 amino acid 
fragment, 43 of 174 amino acid residues (24%) and 70 of 174 amino acid residues (39%) 
positive with a 71 1-884 amino acid fragment, and 46 of 165 residues (27%) identical to and 69 
of 165 residues positive with a 831-884 amino acid fragment all of the protein 1395 residue 
Dovm Syndrome Cell Adhesion Molecule Precursor (CHD2) from Homo Sapiens 
(GenBankAcc:O60469) (SEQ ID NO:57) (Table 2N). 

Table 2N. BLASTP of FCTR2 against Down Syndrome Cell Adhesion Molecule Precursor 

(SEQ ID NO:57) 

> qi 1 12643619 I sp|O60469|DSCA HUMAN DOWN SYNDROME CELL ADHESION MOLECULE PRECURSOR 
(CHD2) 

GI I 6740013 |GB| AAF27525. 1IAF217525 1 (AF217525) DOWN SYNDROME CELL ADHESION MOLECULE 
[HOMO SAPIENS] 

LENGTH = 2012 

SCORE = 70.6 BITS (172), EXPECT = 6E-11 

IDENTITIES = 55/157 (35%), POSITIVES = 75/157 (47%), GAPS = 7/157 (4%) 

VHGDLRPPIIWKRNGLTLNFLDLEDINDFGEDDSLYITKVTTIHMGNYTCHASGHEQLFQ 3 04 

MM I k^^l + k-^ II 1^ ^1 Mill I 

VSGDLPITITWQKDGRPIPGSLGVTIDNIDFTSSLRISNLSLMHNGNYTCIARNEAAAVE 679 

THV-LQVNVPPVIRVYPESQAQEPGVAASLRCHAEGIPMPRITW-LKNGVDVST QM 3 58 

I I III I I I I I I I III IH I I II 

HQSQLIVRVPPKFWQPRDQDGI YGKAVILNCSAEGYPVPTI VWKFSKGAGVPQFQPIAL 739 

S KQLS LLANGS ELH I SS VR YE DTGAYTC I AKNE VG VD 3 95 

^ +UIII I I I IM I I hll I 
NGR I QVLSNGS - LL I KHWEEDSG Y YLCKVSNDVGAD 775 

.6 BITS (120), EXPECT = 7E-05 
IDENTITIES = 49/163 (30%), POSITIVES = 71/163 (43%), GAPS = 16/163 (9%) 

QUERY: 243 CAVHGDLRPPIIWKRNGLTLNFLDLEDINDFGEDDSLYITKVTTIHMGNYTCHASGHEQL 3 02 

l+l I ^ MM II t I + + III 

SBJCT: 335 CSVTGTEDQELSWYRNGEILNPGKNVRITGINHEN-LIMDHMVKSDGGAYQCFVRKDKLS 393 

QUERY: 303 FQTH VLQVNVPPVIRVYPESQAQEPGVAASLRCHAEGIPMPRITW LKNGV 352 

I + Ik I +1 O ^ I II h +II+I III III 
SBJCT: 394 AQDYVQWLEDGTPKIISAFSE-KWSPAEPVSLMCNVKGTPLPTITWTLDDDPILKGG- 451 

QUERY: 353 DVSTQMSKQLSLLAN-GSELHISSVRYEDTGAYTCIAKNEVGV 394 

I +^ I I hill - I II I I I II 

SBJCT: 4 52 - -SHRISQMITSEGNWSYLNISSSQVRDGGVYRCTANNSAGV 4 92 

SCORE = 47.9 BITS (113), EXPECT = 5E-04 
IDENTITIES = 32/85 (37%), POSITIVES = 48/85 (55%), GAPS = 6/85 (7%) 

QUERY: 333 LRCHAEGIPMPRITWLK- -NGVDVSTQMSKQLSLLANGSELHISSVRYEDTGAYTCIAKN 390 

II I I II O + l II + HII + I -h Ihl hill I 
SBJCT: 1305 LPCKAVGDPSPAVKWMKDSNGTPSLVTIDGRRSIFSNGSFI- IRTVKAEDSGYYSCIANN 1363 



QUERY: 


245 


SBJCT: 


620 


QUERY: 


305 


SBJCT: 


680 


QUERY: 


359 


SBJCT: 


740 


SCORE 


^ 50 
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QUERY: 391 EVGVDEDISSLFIE DSARKTLA 412 

I II I I I 

SBJCT: 1364 NWGSDEIIlJ^^3VPPDQPRLTVS 1388 



SCORE =42.9 BITS (100), EXPECT = 0.015 
IDENTITIES = 37/143 (25%), POSITIVES = 60/143 (41%), GAPS = 6/143 (4%) 

QUERY: 270 INDFGEDDSLYITKVTTIHMGNYTCHASGHEQLFQTHVLQVNVPPVIRVYPESQAQEPGV 329 

II ^111 ^ II I HI + III ^ ^ I 

SBJCT: 183 IKDVQNEDGLYNYRCITRHRYTGETRQSNSARLFVSD- -PANSAPSILDGFDHRKAMAGQ 240 

QUERY: 330 AASLRCHAEG I PMPRITWLKNGVDVSTQMSKQLS LLANOS ELHISSVRYEDTGAYTCIAK 389 

I I I I I I Ilk +^ + III hht I 

SBJCT: 241 RVELPCKALGHPEPDYRWLKD- -NMPLELSGRFQKTVTG- -LLIENIRPSDSGSYVCEVS 296 

QUERY: 390 NEVGVDEDISSLFIEDSARKTLA 412 

I I + I k + + + 
SBJCT: 2 97 NRYGTAKVIGRLYVKQPLKATIS 319 

SCORE =41.3 BITS (96), EXPECT = 0.047 

IDENTITIES = 43/174 (24%), POSITIVES = 70/174 (39%), GAPS = 11/174 (6%) 

VHGDLRPPIIWK- -RNGLTLNF- -LDLEDINDFGEDDSLYITKVTTIHMGNYTCHASG 298 

I I IHI + 1^1 ^ III I I I I I 

?^GYPVPTIVWKFSKGAGVPQFQPIAIiNGRIQVLSNGSLLIKHWEEDSGYYLCKVSN 770 

- EQLFQTHVLQVNVP P VI RVY PE SQAQE PC VAAS LRCHAEG I PMPRI TWLKNGVDVST 356 
+ I I H H II + I - I II ^ \ \ 



QUERY: 


243 


SBJCT : 


711 


QUERY: 


299 


SBJCT : 


771 


QUERY: 


357 


SBJCT : 


831 


SCORE 


= 40 



EMARYLVSTKEVGEEVISTLQILPTVREDSGFFSCHAINSYGEDRGIIQLTVQE 884 

40.6 BITS (94), EXPECT = 0.074 
IDENTITIES = 46/165 (27%), POSITIVES = 69/165 (40%), GAPS = 7/165 (4%) 

QUERY: 243 CAVHGDLRPPIIWKRNGLTLNFLDLEDINDFGEDDSLYITKVTT- IHMGNYTCHASGHEQ 301 

III I I H II + I ^ H I ^ I Ilk I 

SBJCT: 525 CRVIGYPYYSIKWYKNSNLLPFNHRQVA- - FENNGTLKLSDVQKEVDEGEYTCNVLVQPQ 582 

QUERY: 302 LFQTHVLQVN- -VPPVIRVYPESQAQEPGVAASLRCHAEGIPMP-RITWLKNGVDVSTQM 358 

I + O III k + I I +1 H III M + + 

SBJCT: 583 LSTSQSVHVTVKVPPFIQPF-EFPRFSIGQRVFIPCVWSGDLPITITWQKDGRPIPGSL 641 

QUERY: 359 SKQLSLLANGSELHISSVRYEDTGAYTCIAKNEVGVDEDISSLFI 4 03 

+ + II lh+ I llllhll 111^ 

SBJCT: 642 GVTIDNIDFTSSLRISNLSLMHNGNYTCIARNEAAAVEHQSQLIV 686 

The amino acid sequence of the FCTR2 protein has 55 of 194 amino acid residues (28%) 
identical to, and 86 of 194 residues (44%) positive with Limbic System- Associated Membrane 
Protein Precursor (LSAMP) from Homo sapiens (SWISSPROT Acc:Q13449) (SEQ ID NO:58) 
(Table 20). 

Table 20. BLASTP of FCTR2 against Limbic System-Associated Membrane Protein 

Precursor (SEQ ID NO:58) 

PTNR:SWISSPROT-ACC:Q13449 LIMBIC SYSTEM-ASSOCIATED MEMBRANE PROTEIN PRECURSOR 
(LSAMP) - HOMO SAPIENS (HUMAN) , 338 AA. 

LENGTH = 338 

SCORE = 191 (67.2 BITS), EXPECT = 6.7E-12, P = 6.7E-12 
IDENTITIES = 55/194 (28%) , POSITIVES = 86/194 (44%) 
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The amino acid Upence of the FCTR2 protein has 68 oi^W amino acid residues (35%) 
identical to, and 92 of 190 residues (48%) positive with Putative Neuronal Cell Adhesion 
Molecule, Short Form from Mus musculus (SPTREMBL Acc:O70246) (SEQ ID NO:59) (Table 
5 2P). 

Table 2P. BLASTP of FCTR2 against Putative Neuronal Cell Adhesion Molecule, Short 

Form from Mus musculus (SEQ ID NO:59) 

PTNR:SPTREMBL-ACC:O70246 PUTATIVE NEURONAL CELL ADHESION MOLECULE (PUNC) 
(PUTATIVE NEURONAL CELL ADHESION MOLECULE, SHORT FORM) - MUS MUSCULUS 
10 (MOUSE) , 793 AA 

LENGTH = 793 



SCORE = 203 (71.5 BITS), EXPECT = 7.0E-12, SUM P{2) = 7.0E-12 
IDENTITIES = 68/190 (35%) , POSITIVES = 92/190 (48%) 



The amino acid sequence of the FCTR2 protein has 58 of 199 amino acid residues (29%) 
identical to, and 91 of 199 residues (45%) positive with CHLAMP, Gl 1-Isoform Precursor from 
Gallus gallus (SPTREMBL Acc: 002869) (SEQ ID NO:60) (Table 2Q). 

Table 2Q. BLASTP of FCTR2 against CHLAMP,Gll-Isoform Precursor from Gallus 
20 gallus (SEQ ID NO:60) 

PTNR:SPTREMBL-ACC:O02869 CHLAMP, Gll-ISOFORM PRECURSOR - GALLUS GALLUS 
(CHICKEN) , 350 AA. 

LENGTH = 350 

25 SCORE = 191 (67.2 BITS), EXPECT = 7.7E-12, P = 7.7E-12 

IDE^mTIES = 58/199 (29%) , POSITIVES - 91/199 (45%) 

The amino acid sequence of the FCTR2 protein has 55 of 194 amino acid residues (28%) 
identical to, and 86 of 194 residues (44%) positive with Limbic System- Associated Membrane 
30 Protein Precursor (LSAMP) from Rattus norvegicus (SWISSPROT Acc:Q62813) (SEQ ID 
N0:61) (Table 2R). 

Table 2R. BLASTP of FCTR2 against Limbic System-Associated Membrane Protein 
Precursor (LSAMP) from Rattus norvegicus (SEQ ID NO:61) 

PTNR:SWISSPROT-ACC:Q62813 LIMBIC SYSTEM-ASSOCIATED MEMBRANE PROTEIN PRECURSOR 
35 (LSAMP) - RATTUS NORVEGICUS (RAT) , 338 AA. 
LENGTH = 338 



SCORE = 188 (66.2 BITS), EXPECT = 1.5E-11, P = 1.5E-11 
IDENTITIES = 55/194 (28%), POSITIVES = 86/194 (44%) 



40 



FCTR2 protein has similarity to cell adhesion molecules, foUistatin, roundabout and 
frazzled (see BlastP results). These genes are involved in neuronal development and 
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reproductive physiologv^Frazzled encodes a Drosophila membej^^the DCC immunoglobulin 
subfamily and is requir^Br CNS and motor axon guidance (Ce^W: 197-204(1 996)). 
Characterization of a rat C6 glioma-secreted follistatin-related protein (FRP) and cloning and 
sequence of the himian homologue is described in Eur. J. Biochem. 225:937-946(1994). This 
protein may modulate the action of some growth factors on cell proliferation and differentiation. 
FRP binds heparin. The follistatin-related protein is a secreted protein and has one follistatin-like 
domain. The cloning and early dorsal axial expression of Flik, a chick follistatin-related gene and 
evidence for involvement in dorsalization/neural induction is presented in Dev. Biol. 178:327- 
342(1996). Roundabout controls axon crossing of the CNS midline and defines a novel 
subfamily of evolutionarily conserved guidance receptors, as shown in Cell 92:205-215(1998). 
cDNA cloning and structural analysis of the human limbic-system- associated membrane protein 
(LAMP) is described in Gene 170:189-195(1996). LAMP, a protein of the OBCAM family that 
contains three immunoglobulin-like C2-type domains, mediates selective neuronal growth and 
axon targeting. LAMP contributes to the guidance of developing axons and remodeling of 
mature circuits in the limbic system. This protein is essential for normal growdi of the 
hippocampal mossy fiber projection. LAMP is attached to the membrane by a GPI- Anchor. It is 
expressed on limbic neurons and fiber tracts as well as in single layers of the superior colliculus, 
spinal chord and cerbellum. Characterization of the human full-iength PTK7 cDNA encoding a 
receptor protein tyrosine kinase-like molecule closely related to chick KLG is disclosed in J. 
Biochem. 1 19:235-239(1996). Based upon homology, FCTR2 proteins and each homologous 
protein or peptide may share at least some activity. 

Functions and therapeutic uses: 
The OMIM gene map has identified this region which the invention maps to (5q21-5q31) 
as associated with susceptibility to the following diseases (OMIM Ids are underlined): 

• Allergy and asthma 

• Hemangioma, 

• capillary infantile Schistosoma mansoni infection, susceptibility/resistance to 
Spinocerebellar ataxia 

• Bronchial asthma 

• Plasmodium falciparum parasitemia, 

• intensity of Corneal dystrophy, Groenouw type I, 121900 ; Corneal dystrophy,lattice type 
L 122200 : 

• Reis-Bucklers comeal dystrophy;Comeal dystrophy, Avellino type Eosinophilia, familial 
Myelodysplastic syndrome; 
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• Myelogenous le^^mia. Acute Cutis laxa, recessive, ^ae I, Deafness, autosomal 
dominant nons^Bromic sensorineural, 1 Contractura^^rachnodactyly, Congenital 
Neonatal alloimmune thrombocytopenia; 

• Glycoprotein la deficiency Male infertility; 

5 • Charcot-Marie-Tooth neuropathy, Demyelinating Gardner syndrome ; 

• Adenomatous polyposis coli; 

• Colorectal cancer; 

• Desmoid disease, hereditary, 135290 ; 

• Turcot syndrome,276300; 

10 • Adenomatous polyposis coli, attenuated 

• Colorectal cancer 



Therefore the invention is implicated in at least all of the above mentioned diseases and 
Q may have therapeutic uses for these diseases. 

Jf'J 15 This sequence has similarity to cell adhesion molecules, foUistatin, roundabout and frazzled 

J:| (see BlastP results). These genes are involved in neuronal development and reproductive 

i:=i physiology. Therefore the invention is also implicated in disorders such as or therapeutic uses 

S for: 

• Neurodegenerative disorders, nerve trauma, epilepsy, mental health conditions 

Q 

20 • Tissue regeneration in vivo and in vitro 

n 

I'li Female reproductive system disorders and pregnancy 

FCTR3 

FCTR3, is an amino acid type II membrane, neurestin-like protein. The FCTR3a nucleic 
25 acid of 1430 nucleotides (also designated 10129612.0.1 18) is shown in Table 3A. An ORF was 
identified beginning with an ATG initiation codon at nucleotides 69-71 and ending with a TAG 
codon at nucleotides 1212-1214. A putative untranslated region upstream from the initiation 
codon and downstream from the termination codon is underlined in Table 3A, and the start and 
stop codons are in bold letters. 

30 Table 3A. FCTR3a Nucleotide Sequence (SEQ ID NO:5) 

AAAAAAGGCGGGGGGTGGACTTAGCAGTGTAATTTGAGACCGGTGGTAAGGATTGGAGCGAGCTAGAG ATGCTGCACGCTGCTAACA 
AGGGAAGGAAGCCTTCAGCTGAGGCAGGTCGTCCCATTCCACCTACATCCTCGCCTAGTCTCCTCCCATCTGCTCAGCTGCCTAGCT 
CCCATAATCCTCCACCAGTTAGCTGCCAGATGCCATTGCTAGACAGCAACACCTCCCATCAAATCATGGACACC 
AATTCTCCCCCi^TTCATACCTGCTCAGAGCATGCTCAGGGCCCCAGCAAGCCTCCAGCAGTGGCCCTCCGAACCACC^ 
35 CGACTCTGAGGCCCCCTCTCCCACCCCCTCACAACCACACGCTGTCCCATCACCACTCGTCCGCCAACTCCCTCAACAGGAACTCAC 
TGACCAATCGGCGGAGTCAGATCCACGCCCCGGCCCCAGCGCCCAATGACCTGGCCACCACACCAGAGTCCGTTCTVGCTTCAGGAC^ 
GCTGGGTGCTAAACAGCAACGTGCCACTGGAGACCCGGCACTTCCTCTTCAAGACCTCCTCGGGGAGCACACCCTTGTTCAGCAGCT 
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CTTCCCCGGGATACCCTTTGACCTCAGGAACGGTTTACACGCCCCCGCCCCGCCTGCTGCCCAGGAATACTTTCT 

TCAAGCTGAAGAAGCCCTCa^ll^CTGCAGCTGGAAATGTGCTGCCCTCTCCGCCA^^CGCGGCCCTCCTCT^ 

TGGCGTATTTCATAGTGCCC'^BpGTTGAAAAACAGCAGCATAGACAGTGGTGAAG^^kGTTGGTC^ 

TCCCACCAGGGGTGTTTTGGAGGrbvCAAATTCy^CATCAGTCAGCCCCAGTTCTTAAAG 
5 TCTTTGGTGTTTACATAAGAAGAGGACTTCCACCATCTCATGCCCAGTATGACTTCATGGAACGTCT^ 

GTGTGGTTGAGTCTCCCAGGGAACGCCGGAGCATACAGACCTTGGTTCAGAATGAAGCCGTGTTTGTGCAGTACCTGGATGTGGGCC 
TGTGGCATCTGGCCTTCTACAATGATGGAAAAGACAAAGAGATGGTTTCCTTCAATACTGT^ 
AGAAAAACAAGCTCAGGGCGCCCACTGATTTGACATTATGATTCAGTGCAGGACTGTCCACGTAACTGCCATG^ 
GTGTCCGGGGTGTGTCACTGTTTCCCAGGATTTCTAGGAGCAGACTGTGCTAAAGACCTTCCTGCCTTGACTTTCTGCAAGACAATC 
10 ATTAATAAAGCTGCTCTGTAAATACTAAAAAAAAAACA 



The FCTR3 polypeptide (SEQ ID N0:5) encoded by SEQ ID N0:5 is 381 amino acid 
residues and is presented using the one-letter code in Table 3B. 

Table 3B. Encoded FCTR3a protein sequence (SEQ ID NO:6). 

15 MLHAANKGRKPSAEAGRPIPPTSSPSLLPSAQLPSSHNPPPVSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGP 
PNHHSQSTLRPPLPPPHNHTLSHHHSSANSLNRNSLTNRRSQIHAPAPAPNDLATTPESV 

TPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWKCAALSAIAAALLLAILLAYFIVPWSLKNSSID^ 
RRVTQEVPPGVFWRSQIHISQPQFLKFNISLGKDALFGVYIRRGLPPSHAQYDFMERLDGKEKWSVVESPRERRSIQTLVQNEAVFV 
QYLDVGL WHLAFYNDGKDKEMVS FNTWLDGT I 

20 

In an alternative embodiment, the 5' end of the FCTR3a nucleic acid could be extended 
^ as it is in the 9826bp FCTR3b (also referred to herein as 10129612.0.405) shown in Table 3C. 

Cd An ORF was identified beginning with an ATG initiation codon at nucleotides 280-282 and 

ending with a TAA codon at nucleotides 8479-848 1 . A putative untranslated region upstream 

-^"^ 25 from the initiation codon and downstream from the termination codon is underlined in Table 3C, 

f - 

It) and the start and stop codons are in bold letters. Italicized bases 1-201 refer to a variable 5' 

p-, region that will be further discussed below. 

Table 3C. FCTR3b Nucleotide Sequence (SEQ ID NO:7) 

[fl TTTAAATCCTCATACCTTAAAGGAGATGTGTATATAAGGGAGTTGGAACCAGCATTAGATGAGTTGACAAAAATGCAGTT 
f=| 30 TCAGTTCTAGAGGTCTGGGAAGTCCAAGAACAAGGTGCTGGCAGATTGGATTCCCCGTGAGGGCTTTCTTCCTGGCTTGA 
ri AGTTGGCTGCTTTCCTGCTGAGACTTCTCATGGCAGAGACTGAGGGTGGCAJ^GTGACAAGTGCC^ 

CTTTTCTGAAAACATCAGCATTCTGCCATATCTGGAATA ATGGATGTAAAGGACCGGCGACACCGCTCTTTGACCAGAGG 
ACGCTGTGGCAAAGAGTGTCGCTACACAAGCTCCTCTCTGGACAGTGAGGACTGCCGGGTGCCCACACAGAAATCCTACA 
GCTCCAGTGAGACTCTGAAGGCCTATGACCATGACAGCAGGATGCACTATGGAAACCGAGTCACAGACCTCATCCACCGG 

35 GAGTCAGATGAGTTTCCTAGACAAGGAACCAACTTCACCCTTGCCGAACTGGGCATCTGTGAGCCCTCCCCACACCGAAG 
CGGCTACTGCTCCGACATGGGGATCCTTCACCAGGGCTACTCCCTTAGCACAGGGTCTGACGCCGACTCCGACACCGAGG 
GAGGGATGTCTCCAGAACACGCCATCAGACTGTGGGGCAGAGGGATAAAATCCAGGCGCAGTTCCGGCCTGTCCAGTCGT 
GAAAACTCGGCCCTTACCCTGACTGACTCTGACAACGAAAACAAATCAGATGATGAGAACGGTCGTCCCATTCCACCTAC 
ATCCTCGCCTAGTCTCCTCCCATCTGCTCAGCTGCCTAGCTCCCATAATCCTCCACCAGTTAGCTGCCAGATGCCATTGC 

40 TAGACAGCAACACCTCCCATCAAATCATGGACACCAACCCTGATGAGGAATTCTCCCCCAATTCATACCTGCTCA^^ 

TGCTCAGGGCCCCAGCAAGCCTCCAGCAGTGGCCCTCCGAACCACCACAGCCAGTCGACTCTGAGGCCCCCTCTCCCACC 
CCCTCACT^CCACACGCTGTCCCATCACCACTCGTCCGCCAACTCCCTCTy^CAGGAACTCACTG^^ 
AGATCCACGCCCCGGCCCCAGCGCCCAATGACCTGGCaVCCACACCAGAGTCCGTTCAGCTTCAGGACAGCTGGGTGC 
AACAGCAACGTGCCACTGGAGACCCGGCACTTCCTCTTCAAGACCTCCTCGGGGAGCACACCCTTGTTCAGCAGCTCTTC 

45 CCCGGGATACCCTTTGACCTCAGGAACGGTTTACACGCCCCCGCCCCGCCTGCTGCCCAGGAATACTTTCTCCAGGAAGG 
CTTTCAAGCTGAAGAAGCCCTCCAAATACTGCAGCTGGAAATGTGCTGCCCTCTCCGCCATTGCCGCGGCCCTCCTCTTG 
GCTATTTTGCTGGCGTATTTCATAGTGCCCTGGTCGTTGAAAAACAGCAGCATAGACAGTGGTGAAGCAGAAGTTGGTCG 
GCGGGTAACACAAGAAGTCCCACCAGGGGTGTTTTGGAGGTCACAAATTCACATCAGTCAGCCCCAGTTCTTAAAGTTCA 
ACATCTCCCTCGGGAAGGACGCTCTCTTTGGTGTTTACATAAGAAGAGGACTTCCACCATCTCATGCCCAGTATGACTTC 

50 ATGGAACGTCTGGACGGGAAGGAGAAGTGGAGTGTGGTTGAGTCTCCCAGGGAACGCCGGAGCATACAGACCTTGGTTCA 
GAATGAAGCCGTGTTTGTGCAGTACCTGGATGTGGGCCTGTGGCATCTGGCCTTCTACAATGATGGAAAAGACAAAGA 
TGGTTTCCTTCAATACTGTTGTCCTAGATTCAGTGCAGGACTGTCCACGTAACTGCCATGGGAATGGTG/^TC 
GGGGTGTGTCACTGTTTCCCAGGATTTCTAGGAGCAGACTGTGCTAAAGCTGCCTGCCCTGTCCTGTGCAGTGG^ 
ACAATATTCTAAAGGGACGTGCCAGTGCTACAGCGGCTGGAAAGGTGCAGAGTGCGACGTGCCCATGAAT^ 

55 ATCCTTCCTGCGGGGGCCACGGCTCCTGCATTGATGGGAACTGTGTCTGCTCTGCTGGCTACAAAGGCGAGCACTGTGAG 
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GAAGTTGATTGCTTGGATCCCACCTGCTCCAGCCACGGAGTCTGTGTGAATGGAGAAT^CTGTG^^ 

TGGTCTGAACTGTGAGCTGGG^kCTCCAGTGCCCAGACCAGTGCAGTGGGCATGC^^m 

TCTGCAGCTGCGATCCCAACl^^GGGTCCCGACTGCTCTGTTGAAGTGTGCTCAGl^fcTC 

TGCATCGGGGGAGCCTGCCGCTGTGAAGAGGGCTGGACAGGCGCAGCGTGTGACCAGCGCGTGTGCCACCCCCGC^ 
TGAGCACGGGACCTGTAAAGATGGCAAATGTGAATGCCGAGAGGGCTGGAATGGTGAACACTGC^^ 
CGGCAGGCACCGAAACAGATGGCTGCCCTGACTTGTGCAACGGTAACGGGAGATGCACACTGGGTC^^ 
TGTGTCTGCCAGACCGGCTGGAGAGGGCCCGGATGCAACGTTGCCATGGAAACTTCCTG^ 

GGGAGATGGCCTGGTGGATTGTTTGGACCCTGACTGCTGCCTGCAGTCAGCCTGTCAGAACAGCCTGCTCTGCCGGGGGT 

CCCGGGACCCACTGGACATCATTCAGCAGGGCCAGACGGATTGGCCCGCAGTGAAGTCCTTCTATGACCGTATCAAGCTC 

TTGGCAGGCAAGGATAGCACCCACATCATTCCTGGAGAGAACCCTTTCAACAGCAGCTTGGTTTC^ 

AGTAGTAACTACT^GATGGAACTCCCCTGGTCGGTGTGAACGTGTCTTTTGTCAAGTACCCAAAATACGGCTACACCATC^ 

CCCGCCAGGATGGCACGTTCGACCTGATCGCAAATGGAGGTGCTTCCTTGACTCTACACTTTGAGCGAGCCCCGTTCATG 

AGCCAGGAGCGCACTGTGTGGCTGCCGTGGAACAGCTTTTACGCCATGGACACCCTGGTGATGAAGACCGAGGAGAACTC 

CATCCCCAGCTGTGACCTCAGTGGCTTTGTCCGGCCTGATCCAATCATCATCTCCTCCCCACTGTCCACCTTCTTTAGTG 

CTGCCCCTGGGCAGAATCCCATCGTGCCTGAGACCCAGGTTCTTCATGAAGAAATCGAGCTCCCTGGTTCCAATG^^ 

CTTCX5CTATCTGAGCTCTAGAACTGCAGGGTACAAGTC7VCTGCTGAAGATCACCATGACCCAGTCCACAGTGCC^ 

CCTCATTAGGGTTCACCTGATGGTGGCTGTCGAGGGGCATCTCTTCCAGAAGTCATTCCAGGCTTCTCCC7UVCCTC 

CCACCTTCATCTGGGACAAGACAGATGCGTATGGCCAAAGGGTGTATGGACTCTCAGATGCTGTTGTGTCTGTCGGGTTT 

GAATATGAGACCTGTCCCAGTCTAATTCTCTGGGAGAAAAGGACAGCCCTCCTTCAGGGATTCGAGCTGGACCCCTCCAA 

CCTCGGTGGCTGGTCCCTAGACAAACACCACATCCTCAATGTTAAAAGTGGAATCCTACACAAAGGCACTG^ 

AGTTCCTGACCCAGCAGCCTGCCATCTVTCACCAGCATCATGGGCAATGGTCGCCGCCGGAGCATTTCCTGTCCC^^ 

AACGGCCTTGCTGAAGGCAACAAGCTGCTGGCCCCAGTGGCTCTGGCTGTTGGAATCGATGGGAGCCTCTATGT^ 

CTTCAATTACATCCGACGCATCTTTCCCTCTCGAAATGTGACCAGCATCTTGGAGTTACGAAATAAAGAGTTTAAAC^^ 

GCAACAACCCAGCACACAAGTACTACTTGGCAGTGGACCCGGTGTCCGGCTCGCTCTACGTGTCCGA^ 

AGAATCTACCGCGTCAAGTCTCTGAGTGGAACCAAAGACCTGGCTGGGAATTCGGAAGTTGTGGCAGGGACGGGAGAGCA 

GTGTCTACCCTTTGATGAAGCCCGCTGCGGGGATGGAGGGAAGGCCATAGATGCAACCCTGATGAGCCCGAGAGGTATTG 

CAGTAGACAAGAATGGGCTCATGTACTTTGTCGATGCCACCATGATCCGGAAGGTTGACCAGAATGGAATCATCTCC^^ 

CTGCTGGGCTCCAATGACCTCACTGCCGTCCGGCCGCTGAGCTGTGATTCCAGCATGGATGTAGCCCAGGTTCGTCTGGA 

GTGGCCAACAGACCTTGCTGTCAATCCCATGGATAACTCCTTGTATGTTCTAGAGAACAATGTCATCCTTCGAATCACCG 

AGAACCy^CCAAGTCAGCATCATTGCGGGACGCCCCATGCACTGCCAAGTTCCTGGCATTGACTAC 

GCCATTCACTCTGCCCTGGAGTCAGCCAGTGCCATTGCCATTTCTCACACTGGGGTCCTCTACATCACTGAGACAGATG 

GAAGAAGATTAACCGTCTACGCCAGGTAACAACCAACGGGGAGATCTGCCTTTTAGCTGGGGCAGCCTCGGACTGCGACT 

GCAAAAACGATGTCAATTGCAACTGCTATTCAGGAGATGATGCCTACGCGACTGATGCCATCTTGAATTCCCCATCATCC 

TTAGCTGTAGCTCCAGATGGTACCATTTACATTGCAGACCTTGGAAATATTCGGATCAGGGCGGTCAGCAAGAAC^ 

TGTTCTTAATGCCTTCAACCAGTATGAGGCTGCATCCCCCGGAGAGCAGGAGTTATATGTTTTCAACGCTGATGGCATCC 

ACCAATACACTGTGAGCCTGGTGACAGGGGAGTACTTGTACAATTTCACATATAGTACTGACAATGATGTCACTGAATTG 

ATTGACAATAATGGGAATTCCCTGAAGATCCGTCGGGACAGCAGTGGCATGCCCCGTCACCTGCTCATGCCTGACAACCA 

GATCATCACCCTCACCGTGGGCACCAATGGAGGCCTCAAAGTCGTGTCCACACAGAACCTGGAGCTTGGTCTCATGACCT 

ATGATGGCAACACTGGGCTCCTGGCCACCAAGAGCGATGAAACAGGATGGACGACTTTCTATGACTATGACCACGAA 

CGCCTGACCAACGTGACGCGCCCCACGGGGGTGGTAACCAGTCTGCACCGGGAAATGGAGAAATCTATTACCATTGACAT 

TGAGAACTCC7VACCGTGATGATGACGTCACTGTCATCACCAACCTCTCTTCAGTAGAGGCCTCCTACACAGTGGTACAAG 

ATCAAGTTCGGAACAGCTACCAGCTCTGTAATAATGGTACCCTGAGGGTGATGTATGCTAATGGGATGGGTATCAGCTTC 

CACAGCGAGCCCCATGTCCTAGCGGGCACCATCACCCCCACCATTGGACGCTGCAACATCTCCCTGCCTATG^ 

CTTAAACTCCATTGAGTGGCGCCTAAGAAAGGAACAGATTAAAGGCAAAGTCACCATCTTTGGCAGGAAGCTCCGGGTCC 

ATGGAAGAAATCTCTTGTCCATTGACTATGATCGAAATATTCGGACTGAAAAGATCTATGATGACCACCGGAAGTTCACC 

CTGAGGATCATTTATGACCAGGTGGGCCGCCCCTTCCTCTGGCTGCCCAGCAGCGGGCTGGCAGCTGTCAACGTGTCATA 

CTTCTTCAATGGGCGCCTGGCTGGGCTTCAGCGTGGGGCCATGAGCGAGAGGACAGACATCGACAAGCAAGGCCGCATCG 

TGTCCCGCATGTTCGCTGACGGGAAAGTGTGGAGCTACTCCTACCTTGACAAGTCCATGGTCCTCCTGCTTCAGAGCCAA 

CGTCAGTATATATTTGAGTATGACTCCTCTGACCGCCTCCTTGCCGTCACCATGCCCAGCGTGGCCCGGCACAGCATGTC 

CACACACACCTCCATCGGCTACATCCGTAATATTTACAACCCGCCTGAAAGCAATGCTTCGGTCATCTTTGACTACAGTG 

ATGACGGCCGCATCCTGAAGACCTCCTTTTTGGGCACCGGACGCCAGGTGTTCTACAAGTATGGGAAACTCTCCAAGTTA 

TCAGAGATTGTCTACGACAGTACCGCCGTCACCTTCGGGTATGACGAGACCACTGGTGTCTTGAAGATGGTCAACCTCCA 

AAGTGGGGGCTTCTCCTGCACCATCAGGTACCGGAAGATTGGCCCCCTGGTGGACAAGCAGATCTACAGGTTCTCCGAGG 

AAGGCATGGTCAATGCCAGGTTTGACTACACCTATCATGACAACAGCTTCCGCATCGCAAGCATCAAGCCCGTCATAAGT 

GAGACTCCCCTCCCCGTTGACCTCTACCGCTATGATGAGATTTCTGGCAAGGTGGAACACTTTGGTAAGTTTGGAGTCAT 

CTATTATGACATCAACCAGATCATCACCACTGCCGTGATGACCCTCAGCAAACACTTCGACACCCATGGGCGGATCAAGG 

AGGTCCAGTATGAGATGTTCCGGTCCCTCATGTACTGGATGACGGTGCAATATGACAGCATGGGCAGGGTGATCT^ 

GAGCTAAAACTGGGGCCCTATGCCAATACCACGAAGTACACCTATGACTACGATGGGGACGGGCAGCTCCAGAGCGTGGC 

CGTCAATGACCGCCCGACCTGGCGCTACAGCTATGACCTTAATGGGAATCTCCACTTACTGAACCCAGGCAACAGTGTGC 

GCCTCATGCCCTTGCGCTATGACCTCCGGGATCGGATAACCAGACTCGGGGATGTGCAGTACAAAATTGACGACGATGGC 

TATCTGTGCCAGAGAGGGTCTGACATCTTCGAATACAATTCCAAGGGCCTCCTAACAAGAGCCTACAACAAGGCCAGCGG 

GTGGAGTGTCCAGTACCGCTATGATGGCGTAGGACGGCGGGCTTCCTACAAGACCAACCTGGGCCACCACCTGCAGTACT 

TCTACTCTGACCTCCACAACCCGACGCGCATCACCCATGTCTACAATCACTCCAACTCGGAGATTACCTCACTGTACTAC 

GACCTCCAGGGCCACCTCTTTGCCATGGAGAGCAGCAGTGGGGAGGAGTACTATGTTGCCTCTGATAACACAGGGACTCC 

TCTGGCTGTGTTCAGCATCAACGGCCTCATGATCAAACAGCTGCAGTACACGGCCTATGGGGAGATTTATTATC 

ACCCCGACTTCCAGATGGTCATTGGCTTCCATGGGGGACTCTATGACCCCCTGACCAAGCTGGTCCACTTCACTCAGCGT 

GATTATGATGTGCTGGCAGGACGATGGACCTCCCCAGACTATACCATGTGGAAAAACGTGGGCAAGGAGCCGGCCCC 

TAACCTGTATATGTTCAAGAGCAACAATCCTCTCAGCAGTGAGCTAGATTTGAAGAACTACGTGACAGATGTGAAAAGCT 

GGCTTGTGATGTTTGGATTTCAGCTTAGCAACATCATTCCTGGCTTCCCGAGAGCCAAAATGTATTTCGTGCCTCCTCCC 

TATGAATTGTCAGAGAGTCAAGCAAGTGAGAATGGACAGCTCATTACAGGTGTCCAACAGACAACAGAGAGACATAACCA 
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GGCCTTCATGGCTCTGGAAGGACAGGTCATTACTAAAAAGCTCCACGCCAGCATCCGAGAGAAAGC^ 
CCACCACCT^CGCCCATCATTC^I^GGCATCATGTTTGCCATCAAAGAAGGGCG^ 

gccagcgaagatagccgcaag^hcatctgtgctgaacaacgcctactacctggac^^rrgca 

caaggacacccactactttgt35Jgattggctcagccgatggcgacctggtcacactaggca 

TGCTAGAGAGCGGGGTGAACGTGACCGTGTCCCAGCCCACGCTGCTGGTCAACGGCAGGACTCGAAGGTTCACGAAC^ 

gagttccagtactccacgctgctgctcagcatccgctatggcctcacccccgacaccctggacgaagagaaggcccgcgt 

CCTGGACCAGGCGAGACAGAGGGCCCTGGGCACGGCCTGGGCCAAGGAGCAGCAGAAAGCCAGGGACGGGA^^ 
GCCGCCTGTGGACTGAGGGCGAGAAGCAGCAGCTTCTGAGCACCGGGCGCGTGCAAGGGTACGAGGGATATT^ 

cccgtggagcaatacccagagcttgcagacagtagcagcaact^tccagtttttaagacagaatgagatggga;^ 

acaaaataatctgctgccattccttgtctgaatggctcagcaggagtaactgttatctcctctcctaaggagatgaagac 

ctaacaggggcactgcggctgggctgctttaggagaccaagtggcaagaaagctcacattttttgagttc^^ 

gtccaagcgagaagtccctcatcctgaagtagactaaagcccggctgaaaattccgaggaaaacaaaacaaacgaatgaa 

tgaacagacact^cacaatgttccaagttcccctaaaatatgacccacttgttergggtctac 

gtgtccaaaaggaacaaaagaacaaaaacgaataagcaaagaagaaaacaaacaaaaac^^ 

ccgataaacaaagaagcgaagataagaaagaaggcctcatatccaattacctcactcattcacatgtgagcgac^ 

ACATCCGCGAGGGCCAGCGTCACCAGACCAGCTGCGGGACAAACCTVCTCAGACTGCTTGTAGGACA^ 

TTTCGTTTAAGCAAATACAGGTGCATTTAAAACaCGACTTTGGGGGTGATTTGTGTGTAGCGCCTGGGGAGGGGGG^ 

AAGAGGAGGAGTGAGCACTGGAAATACTTTTTAAAGAAAAAAAAACATGAGGGAATAAAAGAAATTCCTATCAAAAATC^ 

aagtgaaataataccatccagcacttaactctcaggtcccaactaagtctggcctgagctaattt^ 
gtaaaatttaattcaaaatggtggctataatcactacagataaatttcatactcttttgtctttggagattccattgtgg 

ACAGTAATACGCAGTTACAGGGTGTAGTCTGTTTAGATTCCGTAGTTCGTGGGTATCAGTTTCGGTAGAGGTGCAGCATC 

GTGACACTTTTGCTAACAGGTACCACTTCTGATCACCCTGTACATACATGAGCCGAAAGGCACAATCACTGTT^ 

TAAAATTATTAGTGTGTTTGTTTGGTCCAGAAACTGAGACAATCACATGACAGTCACCACGAGGAGAGAAAATTTAAAAA 

ATAAAAATAAAAACTVAAAAAAATTTTAAAAATTAAAAAAACAAAAATAAAGTCTAATAAGAACTTTGGT^ 

TTTGTAATATACATGTATGAATTGTTCATCGAGTTTTTATATTAATTTTAATTTGCTGCTAAGCAAAGACTAGGGACAGG 

caaagataatttatggcaaagtgtttaaattgtttatacataaataaagtctctaaaactcctgtg 



The FCTR3b polypeptide (SEQ ID N0:8) encoded by SEQ ID N0:7 is 2733 amino acid 
residues and is presented using the one-letter code in Table 3D. The protein has a predicted 
molecular weight of 303424.3 daltons. 

Table 3D. Encoded FCTR3b protein sequence (SEQ ID NO:8). 

mdvkdrrhrsltrgrcgkecrytsssldsedcrvptqksysssetlkaydhdsr^raygnrvtdlihresdefprqgtnftij^e^^ 
cepsphrsgycsdmgilhqgyslstgsdadsdteggmspehairlwgrgiksrrssglssrensaltltdsdnenksddengrpip 
ptsspsllpsaqlpsshnpppvscqmplldsntshqimdtnpdeefspnsyllracsgpqqasssgppnhhsqstlrpplppphnh 
tlshhhssanslnrnsltnrrsqihapapapndlattpesvqlqdswlnsnvpletrhflfktssgstplfsssspgyp 

YTPPPRLLPRNTFSRKAFKIjKKPSKYCSWKCAALSAIAAALLLAILLAYFIVPWSLKNSSIDSGEAEVGRRVTQEVPPGVFWRSQI 

HISQPQFLKFNISLGKDALFGVYIRRGLPPSHAQYDFMERLDGKEKWSVVESPRERRSIQTLVQNEAVFVQYLDVGLWHLAFYNDG 

KDKEMVSFNTVVLDSVQDCPRNCHGNGECVSGVCHCFPGFLGADCAKAACPVLCSGNGQYSKGTCQCYSGWKGAECDV 

SCGGHGSCIDGNCVCSAGYKGEHCEEVDCLDPTCSSHGVCWGECLCSPGWGGLNCELARVQCPDQCSGHGTYLPDTGLCS^^ 

MGPDCSVEVCSVDCGTHGVCIGGACRCEEGWTGAACDQRVCHPRCIEHGTCKDGKCECREGWNGEHCTIGRQTAGTETDGCPDLCN 

GNGRCTLGQNSWQCVCQTGWRGPGCNVAMETSCADNKDNEGDGLVDCLDPDCCLQSACQNSLLCRGSRDPLDIIQQGQTDWPAVKS 

FYDRIKLLAGKDSTHIIPGENPFNSSLVSLIRGQVVTTDGTPLVGVNVSFVKYPKYGYTITRQDGTFDLIANGGASLTIJIFE^ 

msqertwlpvmsfyamdtlvmkteensipscdlsgftopdpiiissplstffsaapgqnpivpetqvlheeielpgsnvklryls 

srtagyksllkitmtqstvplnlirvhl^waveghlfqksfqaspnij^tfiwdktdaygqrvyglsdawsvgfeyetcpslilw 

ekrtallqgfeldpsni/xswsldkhhilnvksgilhkgtgenqfltqqpaiitsimgngrrrsiscpscngij^ 

gidgslyvgdfnyirrifpsrnvtsilelrnkefkhsnnpahkyylavdpvsgslyvsdtnsrriyrvkslsgtkdia 

tgeqclpfdearcgdggkaidatlmsprgiavdknglmyfvdatmirkvdqngiistlixssndltavrplscdssmdva^ 

tdlawpmdnslyvlennvilritenhqvsiiagrpmhcqvpgidyslsklaihsalesasaiaishtgvl^ 

vttngeiclijvgaasdcdckndvncncysgddayatdailnspssi^vapdgtiyiadlgniriravsknkpvi^ 

eqelyvfnadgihqytvslvtgeylynftystdndvtelidragnslkirrdssgmprhllmpdnqiitltvgtngglk^ 

eu;lmtydgntgliatksdetgwttfydydhegrltnvtrptgvvtslhremeksitidiensnrdddvtvito 

dqvrnsyqlcnngtlrvmyangmgisfhsephvijvgtitptigrcnislpmengij^siewrlrkeqikgk^ 

SIDYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSG1J\AVNVSYFFNGRIAGLQRGAMSERTDIDKQGRIVSRMFAIX5^ 

ysyldksmvlllqsqrqyifeydssdrllawmpsvarhsmsthtsigyirniynppesnasvifdysdixsrilktsfi^tgrqvf 

ykygklsklseivydstavtfgydettgvlkmtolqsggfsctiryrkigplvdkqiyrfseeg^wnarfdytyhdns 

visetplpvdlyrydeisgkvehfgkfgviyydinqiittavmtlskhfdthgrikevqyemfrslmywmtvqydsmgrvikrelk 

ix3pyanttkytydydgix3qlqsvavndrptwrysydlngnlhlu^pgnsvriimplrydlrdritrlgdvqy 

feynskglltraynkasgwsvqyrydgvgrrasyktni/3hhlqyfysdlhnptrithvynhsnseitslyydlqghlfame 

EYYVASDNTGTPIAVFSINGLMIKQLQYTAYGEIYYDSNPDFQ^1VIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDYT^^ 

KEPAPFNLYMFKSmPLSSEU)LKNYVTDVKSWLVMFGFQLSNIIPGFPRAKMYFVPPPYELSESQASENGQ 

AFMALEGQVITKKLHASIREKAGHWFATTTPIIGKGIMFAIKEGRVTTGVSSIASEDSRKVASVI^ 

VKIGSATCDLVTLGTTIGRKVLESGVNVTVSQPTLLVNGRTRRFTNIEFQYSTLLLSIRYGLTPDTLDEEKARV^ 

WAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLRQNEMGKR 
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5' end of the FCTR3b 



:ed for in other 



embodiments of FCTR3. The nucleotide sequence for 9823bp FCTR3c (also referred to herein as 
10129612.0.154) has the same nucleotide sequence as FCTR3b except that the italicized region 
5 is replaced with the 201 base sequence shown in Table 3E. An ORF for the total FCTRSc 

nucleotide sequence was identified beginning with an ATG initiation codon at nucleotides 277- 
280 and ending with a TAG codon at nucleotides 8473-8475. This is the same open reading 
frame that is shown in Table 3C, with the corresponding base numbers for FCTR3c. This open 
reading frame will translate the same amino acid sequence as shown in Table 3C for FCTR3b. 



referred to herein as 10129612.0.67). An ORF was identified beginning with an ATG initiation 
codon at nucleotides 277-280 and ending with a TAG codon at nucleotides 8473-8475. This is 
20 the same open reading frame that is shown in Table 3C, with the corresponding base numbers for 
FCTR3d. This open reading frame will translate the same amino acid sequence as shown in 
Table 3DforFCTR3b. 

Table 3F. Encoded FCTR3d 5'end nucleotide sequence (SEQ ID NO:10). 

GCTCCAAAGCGAGCTGGGACCGAAGACTCTAGGCTAAGTTATCTATGTAGATGGTGTCAGGGAGCGAAGCTACTGACCGA 
25 GCTGCTGTTACATCCAGCTTTTTAATTGCCTAAGCGGTCTGGGGCTTGCTTCGTCATTTGGCTTTGCTGTGGAGCACTCC 
TGTAAAGCCAGCTGAATTGTACATCGAAGATCCACCCTTTT 

In yet another embodiment, the italicized region shown in the 5' end of the sequence in 
Table 3C can be replaced with the sequence shown in Table 3G to form 9765 bp FCTR3e (also 
30 referred to as 10129612.0.258). An ORF was identified beginning with an ATG initiation codon 
at nucleotides 210-212 and ending with a TAG codon at nucleotides 8408-8410. This is the same 
open reading frame that is shown in Table 3C, with the corresponding base numbers for 
FCTR3e. This open reading frame will translate the same amino acid sequence as shown in 



10 



Table 3E. Encoded FCTR3c 5'end nucleotide sequence (SEQ ID NO:9). 



GCTCCAAAGCGAGCTGGGACCGAAGACTCTAGGCTAAGTTATCTATGTAGATGGTGTCAGGGAGCGAAGCTACTGACCGA 
GCTGCTGTTACATCCAGCTTTTTAATTGCCTAAGCGGTCTGGGGCTTGCTTCGTCATTTGGCTTTGCTGTGGAGCACTCC 
TGTAAAGCCAGCTGAATTGTACATCGAAGATCCACCCTTTT 



15 



In yet another embodiment, the italicized region shown in the 5' end of the sequence in 
Table 3C can be replaced with the sequence shown in Table 3F to form 9823bp FCTR3d (also 



Table 3D for FCTR3b. 
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Table 3G. Encoded FCTR3e 5'end nucleotide sequence (SEQ ID NO:ll). 



CCAGCATTAGATGAGTTGACAAAAATGCAGTTTCAGCTCTGAAGGTCTGAAAGATTCTGCTGCAACTAAAGCTCTGAAGA 
TTCTGCTACAACTATGACATCCATTTTCTCCCACTTCAGACAGGATGAATACAA 
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f (also referred to as 



10129612.0.352) was fdW having the 9729bp sequence shown liable 3H. An ORF was 
identified beginning with an ATG initiation codon at nucleotides 210-212 and ending with a 
TAG codon at nucleotides 8382-8384. A putative untranslated region upstream from the 
initiation codon and downstream from the termination codon is underlined in Table 30, and the 
start and stop codons are in bold letters. 



CCAGCATTAGATGAGTTGACAAAAATGCAGTTTCAGCTCTGAAGGTCTGAAAGATTCTGCTGCAACTAAAGCTCTGAAGA 

TTCTGCTACAACTATGACATCCT^TTTTCTCCCACTTCAGACAGGATGAATACAAGGTGGCAAAGTGAC^ 

TCAGGCCTGACTTTCCTGAAAACATCAGCATTCTGCCATATCTGGAATAA TGGATGTAAAGGACCGGCGACACCGCTCTT 

TGACCAGAGGACGCTGTGGCAAAGAGTGTCGCTACACAAGCTCCTCTCTGGACAGTGAGGACTGCCGGGTGCCCAC^ 

AAATCCTACAGCTCCAGTGAGACTCTGAAGGCCTATGACCATGACAGCAGGATGCACTATGGAAACCGAGTCACAGACCT 

CATCCACCGGGAGTCAGATGAGTTTCCTAGACAAGGAACCAACTTCACCCTTGCCGAACTGGGCATCTGTGAGCCCTCCC 

CACACCGAAGCGGCTACTGCTCCGACATGGGGATCCTTCACCAGGGCTACTCCCTTAGCACAGGGTCTGACGCCGACTCC 

GACACCGAGGGAGGGATGTCTCCAGAACACGCCATCAGACTGTGGGGCAGAGGGATAAAATCCAGGCGCAGTTCCG 

GTCCAGTCGTGAAAACTCGGCCCTTACCCTGACTGACTCTGACAACGAAAACAAATCAGATGATGAGAACGGTCGTCCCA 

TTCCACCTACATCCTCGCCTAGTCTCCTCCCATCTGCTCAGCTGCCTAGCTCCCATAATCCTCCACCAGTTAGCTGCCAG 

ATGCCATTGCTAGACAGCAACACCTCCCATCAAATCATGGACACCAACCCTGATGAGGAATTCTCCCCCAATTC^ 

GCTCAGAGCATGCTCT^GGGCCCCAGCAAGCCTCCAGCAGTGGCCCTCCGAACCACCACAGCCAGTCGACTCTGAGGC 

CTCTCCCACCCCCTCACAACCACACGCTGTCCCATCACCACTCGTCCGCCAACTCCCTCAACAGGAACT 

CGGCGGAGTCAGATCCACGCCCCGGCCCCAGCGCCCAATGACCTGGCCACCACACCAGAGTCCGTTCAGCTTCAG^ 

CTGGGTGCTAAACAGCAACGTGCCACTGGAGACCCGGCACTTCCTCTTCAAGACCTCCTCGGGGAGCACACCCT 

GCAGCTCTTCCCCGGGATACCCTTTGACCTCAGGAACGGTTTACACGCCCCCGCCCCGCCTGCTGCCCAGGAATACTTTC 

TCCAGGAAGGCTTTCAAGCTGAAGAAGCCCTCCAAATACTGCAGCTGGAAATGTGCTGCCCTCTCCGpCATTGCCGCGGC 

CCTCCTCTTGGCTATTTTGCTGGCGTATTTCATAGTGCCCTGGTCGTTGAAAAACAGCAGCATAGACAGTGGTGAAGCAG 

AAGTTGGTCGGCGGGTAACACAAGAAGTCCCACCAGGGGTGTTTTGGAGGTCACAAATTCACATCAGTCAGCCC^ 

TTAAAGTTC^CATCTCCCTCGGGAAGGACGCTCTCTTTGGTGTTTACATAAGAAGAGGACTTCCACCATCTCATGCCCA 

GTATGACTTCATGGAACGTCTGGACGGGAAGGAGAAGTGGAGTGTGGTTGAGTCTCCCAGGGAACGCCGGAGCATACAGA 

CCTTGGTTCAGAATGAAGCCGTGTTTGTGCAGTACCTGGATGTGGGCCTGTGGCATCTGGCCTTCTACAATGATGGAAAA 

GACAAAGAGATGGTTTCCTTCAATACTGTTGTCCTAGATTCAGTGCAGGACTGTCCACGTAACTGCCATGGGAATGGTGA 

ATGTGTGTCCGGGGTGTGTCACTGTTTCCCAGGATTTCTAGGAGCAGACTGTGCTAAAGCTGCCTGCCCTGTCCTGTGCA 

GTGGGAATGGACAATATTCTAAAGGGACGTGCCAGTGCTACAGCGGCTGGAAAGGTGCAGAGTGCGACGTGCCCATGAAT 

CAGTGCATCGATCCTTCCTGCGGGGGCCACGGCTCCTGCATTGATGGGAACTGTGTCTGCTCTGCTGGCTACAAAGGCGA 

GCACTGTGAGGAAGTTGATTGCTTGGATCCCACCTGCTCCAGCCACGGAGTCTGTGTGAATGGAGAATGCCTGTGCAGCC 

CTGGCTGGGGTGGTCTGAACTGTGAGCTGGCGAGGGTCCAGTGCCCAGACCAGTGCAGTGGGCATGGCACGTACCT^ 

GACACGGGCCTCTGCAGCTGCGATCCCAACTGGATGGGTCCCGACTGCTCTGTTGAAGTGTGCTCAGTAGACTGTGGCAC 

TCACGGCGTCTGCATCGGGGGAGCCTGCCGCTGTGAAGAGGGCTGGACAGGCGCAGCGTGTGACCAGCGCGTGTGCCACC 

CCCGCTGCATTGAGCATGGGACCTGTAAAGATGGCAAATGTGAATGCCGAGAGGGCTGGAATGGTGAAC^ 

GATGGCTGCCCTGACTTGTGCAACGGTAACGGGAGATGCACACTGGGTCAGAACAGCTGGCAGTGTGTCTGCCAGACCGG 

CTGGAGAGGGCCCGGATGCAACGTTGCCATGGAAACTTCCTGTGCTGATAACAAGGATAATGAGGGAGATGGCCTGGTGG 

ATTGTTTGGACCCTGACTGCTGCCTGCAGTCAGCCTGTCAGAACAGCCTGCTCTGCCGGGGGTCCCGGGACCCACTGGAC 

ATCATTCAGCAGGGCCAGACGGATTGGCCCGCT^GTGAAGTCCTTCTATGACCGTATCAAGCTCTTGGCAGGCT^ 

CACCCACATCATTCCTGGAGAGAACCCTTTCAACAGCAGCTTGGTTTCTCTCATCCGAGGCCAAGTAGTAACTACAGATG 

GAACTCCCCTGGTCGGTGTGAACGTGTCTTTTGTCT^GTACCCAAAATACGGCTACT^CCATCACCCGCCAGGATGGC^ 

TTCGACCTGATCGCAAATGGAGGTGCTTCCTTGACTCTACACTTTGAGCGAGCCCCGTTCATGAGCCAGGAGCGCACTGT 

GTGGCTGCCGTGGAACAGCTTTTACGCCATGGACACCCTGGTGATGAAGACCGAGGAGAACTCCATCCCCAGCTGTGACC 

TCT^GTGGCTTTGTCCGGCCTGATCCAATCATCATCTCCTCCCCACTGTCCACCTTCTTTAGTGCTGCCCCTGGGCAGAAT 

CCCATCGTGCCTGAGACCCAGGTTCTTCATGAAGAAATCGAGCTCCCTGGTTCCAATGTGAAACTTCGCTATCTGAGCTC 

TAGAACTGCAGGGTACAAGTCACTGCTGAAGATCACCATGACCCAGTCCACAGTGCCCCTGAACCTCATTAG^ 

TGATGGTGGCTGTCGAGGGGCATCTCTTCCAGAAGTCATTCCAGGCTTCTCCCAACCTGGCCTCCACCTTCATCTGGGAC 

AAGACAGATGCGTATGGCCAAAGGGTGTATGGACTCTCAGATGCTGTTGTGTCTGTCGGGTTTGAATATGAGACCTGTCC 

CAGTCTAATTCTCTGGGAGAAAAGGACAGCCCTCCTTCAGGGATTCGAGCTGGACCCCTCCAACCTCGGTGGCTGGTCCC 

TAGACAAACACCACATCCTCAATGTTAAAAGTGGAATCCTACACAAAGGCACTGGGGAAAACCAGTTCCTGACCCAGCAG 

CCTGCCATCATCACCAGCATCATGGGCAATGGTCGCCGCCGGAGCATTTCCTGTCCCTVGCTGCAACGGCCTTGCTGAAGG 

CT^CAAGCTGCTGGCCCCAGTGGCTCTGGCTGTTGGAATCGATGGGAGCCTCTATGTGGGTGACTTCAATTACA 

GCATCTTTCCCTCTCGAAATGTGACCAGCATCTTGGAGTTACGAAATAAAGAGTTTAAACATAGCAACAACCCAGCACAC 

AAGTACTACTTGGCAGTGGACCCCGTGTCCGGCTOjCTCTACGTGTCCGACACCAACAGCAGGAGAATCTACCGCGTCAA 

GTCTCTGAGTGGAACCT^GACCTGGCTGGGAATTCGGAAGTTGTGGCAGGGACGGGAGAGCAGTGTCTAC 

AAGCCOSCTGCGGGGATGGAGGGAAGGCCATAGATGCAACCCTGATGAGCCCGAGAGGTATTGCAGTAGACAAGA^ 

CTCATGTACTTTGTCGATGCCACCATGATCCGGAAGGTTGACCAGAATGGAATCATCTCCACCCTGCTGGGCTCCAATC 

CCTCACTGCCGTCCGGCCGCTGAGCTGTGATTCCAGCATGGATGTAGCCCAGGTTCGTCTGGAGTGGCCAACAGACCT^ 

CTGTCAATCCCATGGATAACTCCTTGTATGTTCTAGAGAACAATGTCATCCTTCGAATCACCGAGAACCACCAAGTCAGC 



Table 3H. Encoded FCTR3f nucleotide sequence (SEQ ID NO: 12). 
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ATCATTGCGGGACGCCCCATGCACTGCCAAGTTCCTGGCATTGACTACTCACTCAGCA^TAGCCATTCACTCTC 

GGAGTa^GCCAGTGCCATTGO^fcrCTCACACTGGGGTCCTCTACATCACTGAGAq^^^ 

TACGCCAGGTAACAACCAAcd^^GATCTGCCTTTTAGCTGGGGCAGCCTCGGACTl^fcTGCT^^ 

TGCAACTGCTATTCAGGAGATGATGCCTACGCGACTGATGCCATCTTGAATTCCCCATCATCCTTAGCTGTAGCT^ 

TGGTACCATTTACATTGCAGACCTTGGAAATATTCGGATCAGGGCGGTCAGCAAGAACAAGCCTGTTCTTAATC 

ACCAGTATGAGGCTGCATCCCCCGGAGAGCAGGAGTTATATGTTTTCT^CGCTGATGGCATCCACCAATAC^^ 

CTGGTGACAGGGGAGTACTTGTACAATTTCACTVTATAGTACTGACAATGATGTCACTGAATTGAT^ 

TTCCCTGAAGATCCGTCGGGACAGCAGTGGCATGCCCCGTCACCTGCTCATGCCTGACT^CCAGATCATCACCCTCA 

TGGGCACCAATGGAGGCCTCAAAGTCGTGTCCACACAGAACCTGGAGCTTGGTCTCATGACCTATGATGGCAAC^ 

CTCCTGGCCACCAAGAGCGATGAAACAGGATGGACGACTTTCTATGACTATGACCACGAAGGCCGCCTGACCAACGTGAC 

gcgccccacgggggtggtaaccagtctgcaccgggaaatggagaaatctattaccattgacattgagaactccaaccgtg 

atgatgacgtcactgtcatcaccaacctctcttct^gtagaggcctcctacacagtggtacaagatcaagttcggaa 

taccagctctgtaataatggtaccctgagggtgatgtatgctaatgggatgggtatcagcttccacagcgagccccatgt 

cctagcgggcaccatcacccccaccattggacgctgcaacatctccctgcctatggagaatggcttaaactccattgag^ 

ggcgcctaagaaaggaacagattaaaggcaaagtcaccatctttggcaggaagc^ 

tccattgactatgatcgaaatattcggactgaaaagatctatgatgaccaccggaagttcaccctgaggatcat^ 

ccaggtgggccgccccttcctctggctgccct^gcagcgggctggcagctgtcaacgtgtc^ 

tggctgggcttcagcxstggggccatgagcgagaggacagacatcgacaagcaaggccgcatcg 

gacgggaaagtgtggagctactcctaccttgacaagtccatggtcctcctgcttcagagccaacgtcagtatatatttga 

gtatgactcctctgaccgcctccttgccgtcaccatgcccagcgtggcccggcacagcatgtccacacacacctcct^tc^ 

gctacatccgtaatatttacaacccgcctgaaagct^tgcttcggtcatctttgactacagtgatgacggccgcatcc^ 

aagacctcctttttgggcaccggacgccaggtgttctacaagtatgggaaactctccaagttatcagagattgtctacga 

cagtaccgccgtcaccttcgggtatgacgagaccactggtgtcttgaagatggtcaacctccaaagtgggggcttctcct 

gcaccatcaggtaccggaagattggccccctggtggacaagcagatctacaggttctccgaggaaggcatggta^ 

aggtttgactacacctatcatgacaacagcttccgcatcgcaagcatcaagcccgtcataagtgagactcccctccccgt 

tgacctctaccgctatgatgagatttctggcaaggtggaacactttggtaagtttggagtcatctattatgacatcaacc 

agatcatcaccactgccgtgatgaccctcagcaaact^cttcgacacccatgggcggatcaaggaggtccagtatgag^^ 

ttccggtccctcatgtactggatgacggtgcaatatgacagcatgggcagggtgatcaagagggagctaaaactg^^ 

ctatgccaataccacgaagtacacctatgactacgatggggacgggcagctccagagcgtggccgtcaatgaccgcccga 

cctggcgctacagctatgaccttaatgggaatctccacttactgaacccaggcaacagtgtgcgcctcatgcccttgcgc 

tatgacctccgggatcggataaccagactcggggatgtgcagtacaaaattgacgacgatggctatctgtgccagagagg 

gtctgacatcttcgaatacaattccaagggcctcctaacaagagcctactuvcaaggccagcgg^ 

gctatgatggcgtaggacggcgggcttcctacaagaccaacctgggccaccacctgcagtacttctactctgacctccac 

aacccgacgcgcatcacccatgtctacaatct^ctccaactcggagattacctcactgtactacgacctccagggc^ 

ctttgccatggagagcagcagtggggaggagtactatgttgcctctgataacacagggactcctctggctgtgttcagca 

tcaacggcctcatgatctyu^cagctgcagtacacggcctatggggagatttattatgactccaaccccgacttcc^ 

gtcattggcttccatgggggactctatgaccccctgaccaagctggtccacttcactcagcgtgattatgatgtgctggc 

aggacgatggacctccccagactataccatgtggaaaaacgtgggcaaggagccggccccctttaacctgtatatgttca 

agagcaacaatcctctcagcagtgagctagatttgaagaactacgtgacagatgtgaaaagctggcttgtgatgtttg^ 

tttcagcttagcaacatcattcctggcttcccgagagccaa^tgtatttcgtgcctcctccctatgaattgtcagagag 

tcaagcaagtgagaatggacagctcattacaggtgtccaacagacaacagagagacataacca^ 

aaggacaggtcattactaaaaagctccacgccagcatccgagagaaagcaggtcactggtttgccaccaccacgcc 

attggcaaaggcatct^tgtttgccatcaaagaagggcgggtgacmcgggcgtgtccagcatcgccag^^ 

ct^ggtggcatctgtgctgaact^cgcctactacctggacaagatgcactacagcatcgagggcaac^ 

ttgtgaagattggctcagccgatggcgacctggtcacactaggcaccaccatcggccgcaaggtgctagagagcg<^ 

aacgtgaccgtgtcccagcccacgctgctggtcaacggcaggactcgaaggttcacgaacattgagttccagtactccac 

gctgctgctcagcatccgctatggcctcacccccgacaccctggacgaagagaaggcccgcgtcctggaccaggcgagac 

agagggccctgggcacggcctgggccaaggagcagcagaaagccagggacgggagagaggggagccgcctgtggactgag 

ggcgagaagcagcagcttctgagcaccgggcgcgtgcaagggtacgagggatattacgtgcttcccgtggagcaataccc 

agagcttgcagacagtagcagcaacatccagtttttaagacagaatgagatgggaaagaggtaa caaaataatc 

cattccttgtctgaatggctcagcaggagtaactgttatctcctctcctaaggagatgaagacctaacaggggcactgcg 

gctgggctgctttaggagaccaagtggcaagaaagctcacattttttgagttcaaatgctactgtccaagcgagaagtcc 

ctcatcctgaagtagactaaagcccggctgaaaattccgaggaaaacaaaacaaacgaatgaatgaacagaca 

tgttccaagttcccctaaaatatgacccacttgttctgggtctacgcagaaaagagacgcaaagtgtccaaaaggaac^ 

aagaacaaaaacgaataagcaaagaagaaaacaaacaaaaacaaaacaaaacaaac^ 

gaagataagaaagaaggcctcatatccaattacctcactcattcacatgtgagcgacacgcagacatccgcgagggccag 

cgtcaccagaccagctgcgggacaaaccactcagactgcttgtaggacaaatacttctgacattttcgttt;^ 

caggtgcatttaaaacacgactttgggggtgatttgtgtgtagcgcctggggaggggggataaaagaggaggagtgagca 

ctggaaatactttttaaagaaaaaaaj^catgagggaataaaagaaattcctatcaaaaatcaaagtgaaataatacc^ 

ccagcacttaactctcaggtcccaactaagtctggcctgagctaatttatttgagcgcagagtgtaaaatttaattcaaa 

atggtggctataatcactacagatj\aatttcatactcttttgtctttggagattccattgtggacagtaatacgcagtta 

cagggtgtagtctgtttagattccgtagttcgtgggtatcagtttcggtagaggtgcagcatcgtgacacttttgctaac 

aggtaccacttctgatcaccctgtacatacatgagccgaaaggcacaatcactgtttcagatttaaaattattagtgtgt 

ttgtttggtccagaaactgagacaatcacatgacagtcaccacgaggagagaaaatttaaaaaataaaaataaaaacaaa 

aaaaattttaaaaattaaaaaaacaaaaataaagtctaataagaactttggtacaggaacttttttgtaatatacatgta 

tgaattgttcatcgagtttttatattaattttaatttgctgctaagcaaagactagggacaggcaaagataattta 

aaagtgtttaaattgtttatacataaataaagtctctaaaactcctgtg 
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D NO: 12 is 2724 amino 



31. This sequence differs 



Table 31. Encoded FCTR3f protein sequence (SEQ ID NO:13) 



mdvkdrrhrsltrgrcgkecrytsssldsedcrvptqksysssetlkaydhdsrmhyg^frvtdlihresdefprcx5tnfti^ 

cepsphrsgycsdmgilhqgyslstgsdadsdteggmspehairlwgrgiksrrssglssrensaltltdsdnenksddengrpip 

ptsspsllpsaqlpsshnpppvscqmplldsntshqimdtnpdeefspnsyllracsgpqqasssgppnhhsqstlrpplppphnh 

tlshhhssanslnrnsltnrrsqihapapapntolattpesvqlqdswvlnswpletrhflfct^ 

ytppprllprntfsrkafklkkpskycswkcaalsaiaaalllaillayfivpwslknssidsgeaevgrrvtqevppgv 

hisqpqflkfnislgkdalfgvyirrglppshaqydfmerlix5kekwsvvesprerrsiqtlvqneavfvqyldvglwhi^ 

kdkeiwsfntvvldsvqdcprnchgngecvsgvchcfpgflgadcakaacpvlcsgngqyskgtcqcysgwkga^ 

scgghgscidgncvcsagykgehceevdcldptcsshgvcwgeclcspgwgglncelarvqcpdqcsghgtylpdtglcscdp^^ 

mgpdcsvevcsvdcgthgvciggacrceegwtgaacdqrvchprciehgtckdgkcecregwngehctidgcpdlcngngrctlgq 

nswqcvcqtgwrgpgcotametscadnkdnegdglvdcldpdcclqsacqnsllcrgsrdpldiiqqgqtdwpavksfydrik^ 

gkdsthiipgenpfnsslvslirgqvvttdgtplvgvnvsfvkypkygytitrqdgtfdlianggasltijiferap™^ 

pwnsfyaivldtlvmkteensipscdlsgfvrpdpiiissplstffsaapgqnpivpetqvlheeielpgsnvklrylssrtagyk^ 

LKITMTQSWPLNLIRVHLMVAVEGHLFQKSFQASPNIASTFIWDKTDAYGQRVYGLSDAWSVGFEYETCPSLIL^^ 

feldpsnlggwsldkhhilnvksgii^kgtgenqfltqqpaiitsimgngrrrsiscpscngi^ 

dfnyirrifpsrnvtsilelrnkefkhs^^^pahkyylavdpvsgslyvsdtnsrriyrvkslsgtkdlagnsewa 

earcgdggkaidatmsprgiavdknglmyfvdatmirkvdqngiistlixssndltawplscdssmdvaqvrlewptd^ 

nslyvlennvilritenhqvsiiagrpmhcqvpgidyslsklaihsalesasaiaishtgvlyitetdekkii^lrqvttng^ 

lagaasdcdcktowcncysgddayatdailnspsslavapdgtiyiadlgniriravsknkpvlna™ 

dgihqytvslvtgeylynftystdndvtelidnngnslkirrdssgmprhllmpdnqiitltvgtngglkwstqnlei^ 

NTGLLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYT^ 
a^GTLRVMYANGMG I S FHS E PHVIAGT I TPT IGROT I S LPMENGLNS I EWRLRKEQ I KGKVT I FG^ 
TEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADGKVWS 

lllqsqrqyifeydssdrllavtmpsvarhsmsthtsigyirniynppesnasvifdysdix3rilktsflgtgrqvfykygkxskl 
seivydstavtfgydettgvlkmvnlqsggfsctiryrkigplvdkqiyrfseegmvnarfdytyhdnsfriasikpvisetplpv 
dlyrydeisgkvehfgkfgviyydinqiittavmtlskhfdthgrikevqyemfrslmywmtvqydsmgrvikrelklgpyant^ 

YTYDYDGDGQLQSVAVNDRPTWRYSYDIJ^GNLHLLNPGNSVRLMPLRYDLRDRITRLGDVQYKIDDDGYLCQRGSDIFEYNSKGLL 

TRAYNKASGWSVQYRYDGVGRRASYKTNIX3HHLQYFYSDLHNPTRITHVYNHSNSEITSLYYDLQGHLFAMESSSGEEYYVAS 

GTPLAVFSINGLMIKQLQYTAYGEIYYDSNPDFQIWIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDYTMWKNVGKEPAPF^ 

MFKSIWPLSSELDLKNYVTDVKSWLVMFGFQLSNIIPGFPRAKMYFVPPPYELSESQASENGQLITGVQQTTERHNQAFN^ 

ITKKLHASIREKAGHWFATTTPIIGKGIMFAIKEGRWTGVSSIASEDSRKVASVLNNAYYLDKMHYSIEGKI)THYFVK^ 

LVTLGTTIGRKVLESGVNVTVSQPTLLWGRTRRFTNIEFQYSTLLLSIRYGLTPDTLDEEKARVLDQARQRALGTAWAK^ 

DGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLRQNEMGKR 



In a BLASTN search it was found that the FCTR3a nucleic acid has homology to three 
fragments of Mus musculus odd Oz/ten-m homolog 2. It has 634 of 685 bases (92%) identical to 



bases 614-1298, 365 of 406 bases (89%) identical to bases 1420-1825, and 93 of 103 bases 
(90%) identical to bases 1823-1925 of Mus musculus odd Oz/ten-m homolog 2 (GenBank Acc: 
NM_0 11856.2) (Table 3 J). 

Table 3 J. BLASTN of FCTR3a against Mus musculus odd Oz/ten-m homolog 2 (SEQ ID 

NO:62) 



> GI|7657414|REF|NM 011856.21 MUS MUSCULUS ODD OZ/TEN-M HOMOLOG 2 (DROSOPHILA) (0DZ2) , 
MRNA 



SCORE = 954 BITS (481), EXPECT = 0.0 
IDENTITIES ^ 634/685 (92%) 
STRAND = PLUS / PLUS 

QUERY : 114 GGTCGTCCCATTCCACCTACATCCTCGCCTAGTCTCCTCCCATCTGCTCAGCTGCCTAGC 173 

IIIIIIIIIIIIIIIIIMIIIMIM MM IMMMMMMMMMMMMM 

SBJCT: 614 GGTCGTCCCATTCCACCTACATCCTCGTCTAGCCTCCTCCCATCTGCTCAGCTGCCTAGC 673 



LENGTH =8797 
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QUERY: 174 



SBJCT: 674 



QUERY: 234 



SBJCT: 734 



TCCCATAATCCTCCACCAGTTAGCTGCCAGATGCCATTGCTAGACAi 

IMIIIIII^IIIIMMIMIIIIIIIIMMIIIIIII 

TCCCATAAT^BiCACCAGTTAGCTGCCAGATGCCATTGCTAGAi 



.CAGgAA( 



.CACCTCCCAT 233 

llllllllll 
.CACCTCCCAT 733 



293 



CAAATCATGGACACCAACCCTGATGAGGAATTCTCCCCCAATTCATACCTGCTCAGAGCA 

II IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIillMMIIIIIIII 
CAGATCATGGACACCAACCCTGATGAGGAATTCTCCCCCAATTCATACCTGCTCAGAGCA 793 



QUERY : 294 TGCTCAGGGCCCCAGCAAGCCTCCAGCAGTGGCCCTCCGAACCACCACAGCCAGTCGACT 

llllllllllllllllllllllllllllllllllllll IIIIIIIMIIIIIIII II 
SBJCT: 794 TGCTCAGGGCCCCAGCAAGCCTCCAGCAGTGGCCCTCCAAACCACCACAGCCAGTCAACA 



353 



853 



QUERY : 354 CTGAGGCCCCCTCTCCCACCCCCTCACAACCACACGCTGTCCCATCACCACTCGTCCGCC 413 

IMIIIIIIIIIII lllllllllll IIIMIII llllllll llllllll II III 

SBJCT: 854 CTGAGGCCCCCTCTGCCACCCCCTCATAACCACACCCTGTCCCACCACCACTCCTCGGCC 913 

QUERY: 414 AACTCCCTCAACAGGAACTCACTGACCAATCGGCGGAGTCAGATCCACGCCCCGGCCCCA 473 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII IIIMIIIIII II II 
SBJCT: 914 AACTCCCTCAACAGGAACTCACTGACCAATCGGCGGAGTCAAATCCACGCCCCAGCTCCT 973 

QUERY: 474 GCGCCCAATGACCTGGCCACCACACCAGAGTCCGTTCAGCTTCAGGACAGCTGGGTGCTA 533 

llllllll llllllllllllll llllllll IIIMIII MM! lllllllllll 

SBJCT: 974 GCGCCCAACGACCTGGCCACCACCCCAGAGTCTGTTCAGCTCCAGGATAGCTGGGTGCTG 1033 

QUERY: 534 AACAGCAACGTGCCACTGGAGACCCGGCACTTCCTCTTCAAGACCTCCTCGGGGAGCACA 593 

mil Mill lllllllllll lllllllllll Mill II II II II llllll 
SBJCT: 1034 AACAGTAACGTCCCACTGGAGACTCGGCACTTCCTTTTCAAAACGTCGTCTGGAAGCACA 1093 

QUERY: 594 CCCTTGTTCAGCAGCTCTTCCCCGGGATACCCTTTGACCTCAGGAACGGTTTACACGCCC 653 

III llllllllllllllll lllllllllllilllMIIIIM II Mill II II 

SBJCT: 1094 CCCCTGTTCAGCAGCTCTTCTCCGGGATACCCTTTGACCTCAGGGACCGTTTATACACCA 1153 
QUERY: 654 CCGCCCCGCCTGCTGCCCAGGAATACTTTCTCCAGGAAGGCTTTCAAGCTGAAGAAGCCC 713 

II llllllllllllll IIMIII IIMIIIIMIIII llllllllllllll III 

SBJCT: 1154 CCACCCCGCCTGCTGCCACGGAATACATTCTCCAGGAAGGCCTTCAAGCTGAAGAAACCC 1213 

QUERY: 714 TCCAAATACTGCAGCTGGAAATGTGCTGCCCTCTCCGCCATTGCCGCGGCCCTCCTCTTG 773 

llllllllllllll lllllllllllllllll II IIIM Mill llllllllllll 
SBJCT: 1214 TCCAAATACTGCAGTTGGAAATGTGCTGCCCTGTCTGCCATCGCCGCCGCCCTCCTCTTG 1273 

QUERY: 774 GCTATTTTGCTGGCGTATTTCATAG 798 

II IIMIIIIIIi llllllllll 
SBJCT: 1274 GCCATTTTGCTGGCATATTTCATAG 12 98 



SCORE = 480 BITS (242) , EXPECT 
IDENTITIES = 365/406 (89%) 
STRAND = PLUS / PLUS 



E-132 



QUERY: 7 97 AGTGCCCTGGTCGTTGAAAAACAGCAGCATAGACAGTGGTGAAGCAGAAGTTGGTCGGCG 856 

IIIMIIIIIII IIIIIIIIIIIIIIIIIIMIIIIM llllllllllllllllllll 
SBJCT: 1420 AGTGCCCTGGTCATTGAAAAACAGCAGCATAGACAGTGGCGAAGCAGAAGTTGGTCGGCG 14 79 

QUERY: 857 GGTAACACAAGAAGTCCCACCAGGGGTGTTTTGGAGGTCACAAATTCACATCAGTCAGCC 916 

III IIIM IIIIIIIIIIIIIIIIIIIIIMIIIIII II lllllllllllllllli 
SBJCT: 14 80 GGTGACACAGGAAGTCCCACCAGGGGTGTTTTGGAGGTCCCAGATTCACATCAGTCAGCC 1539 

QUERY: 917 CCAGTTCTTAAAGTTCAACATCTCCCTCGGGAAGGACGCTCTCTTTGGTGTTTACATAAG 976 

II IMMMMMMMMMIIM II Mill II Mill Mill II Mill 

SBJCT: 1540 TCAATTCTTAAAGTTCAACATCTCCCTGGGCAAGGATGCCCTCTTCGGTGTCTATATAAG 1599 

QUERY: 977 AAGAGGACTTCCACCATCTCATGCCCAGTATGACTTCATGGAACGTCTGGACGGGAAGGA 1036 

llllllll Mill IlillMIIIIIIIIIIIIIIIIIIIIII Mill II Mill 
SBJCT: 1600 GAGAGGACTACCACCGTCTCATGCCCAGTATGACTTCATGGAACGCCTGGATGGAAAGGA 1659 

QUERY: 1037 GAAGTGGAGTGTGGTTGAGTCTCCCAGGGAACGCCGGAGCATACAGACCTTGGTTCAGAA 1096 

III Mill Mill Mill llllllllllllllllllll Mill MM Mill 

SBJCT: 1660 GAAATGGAGCGTGGTCGAGTCGCCCAGGGAACGCCGGAGCATCCAGACTCTGGTGCAGAA 1719 



QUERY: 1097 TGAAGCCGTGTTTGTGCAGTACCTGGATGTGGGCCTGTGGCATCTGGCCTTCTACAATGA 1156 
II II lllllllllllllll lllllllllllllllllll lllllllllllllllll 
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SBJCT: 1720 CGAGGCTGTGTTTGTGCAGTACTTGGATGTGGGCCTGTGGCACC 



QUERY: 1157 TGGAAAAGA^BIgAGATGGTTTCCTTCAATACTGTTGTCCTAGA'! 

II II IImTiIIIIIII llllllll lllllllll Mill 

SBJCT: 1780 CGGCAAGGACAAGGAGATGGTCTCCTTCAACACTGTTGTCTTAGAT 1825 



ITG^jCT 
A^^R) 



TTCTACAATGA 1779 



SCORE = 125 BITS (63), EXPECT = 7E-26 
IDENTITIES = 93/103 (90%) 
STRAND = PLUS / PLUS 

QUERY: 1258 GATTCAGTGCAGGACTGTCCACGTAACTGCCATGGGAATGGTGAATGTGTGTCCGGGGTG 1317 

lilllMMIIMIIIMIIIII Mill II illll IMIIIII Mill II II 

SBJCT: 1823 GATTCAGTGCAGGACTGTCCACGGAACTGTCACGGGAACGGTGAATGCGTGTCTGGACTG 1882 

QUERY: 1318 TGTCACTGTTTCCCAGGATTTCTAGGAGCAGACTGTGCTAAAG 1360 

Illlllllllllllllllll Illll llllllllllllllll 
SBJCT: 1883 TGTCACTGTTTCCCAGGATTCCTAGGTGCAGACTGTGCTAAAG 1925 



In another BLASTN search it was found that the FCTR3a nucleic acid has homology to 
three fragments of Gallus gallus mRNA for teneurin-2. It has 541 of 629 bases (86%) identical 
to bases 502-1 130, 302 of 367 bases (82%) identical to bases 1330-1696, and 87 of 103 bases 
(84%) identical to bases 171 1-1813 of Gallus gallus mRNA for teneurin-2 (EMBL Acc: 
AJ24571 1.1) (Table 3K). 

Table 3K. BLASTN of FCTR3a against Gallus gallus mRNA for teneurin-2 (SEQ ID 

NO:63) 

> GI I 6010048 I EMB I AJ245711 . 1 1 GGA245711 GALLUS GALLUS MRNA FOR TENEURIN-2, SHORT SPLICE 
VARIANT (TEN2 GENE) 

LENGTH = 24 96 



SCORE = 549 BITS (277), EXPECT 
IDENTITIES = 541/629 (86%) 
STRAND = PLUS / PLUS 



E-153 



QUERY: 


114 


SBJCT : 


502 


QUERY: 


174 


SBJCT : 


562 


QUERY: 


234 


SBJCT : 


622 


QUERY: 


294 


SBJCT : 


682 


QUERY: 


354 


SBJCT : 


742 


QUERY: 


414 


SBJCT : 


802 


QUERY: 


474 


SBJCT : 


862 



GGTCGTCCCATTCCACCTACATCCTCGCCTAGTCTCCTCCCATCTGCTCAGCTGCCTAGC 173 

lllllllllllllllllllllllllll MM M MMMMMMMMMM M 

GGTCGTCCCATTCCACCTACATCCTCGTCTAGCCTTCTCCCATCTGCTCAGCTGCCCAGT 561 
TCCCATAATCCTCCACCAGTTAGCTGCCAGATGCCATTGCTAGACAGCAACACCTCCCAT 233 

M MMMMMMMMMMMMMMMMMMMMMMMI M MMM 

TCTCATAATCCTCCACCAGTTAGCTGCCAGATGCCATTGCTAGACAGCAATACGTCCCAT 621 
CAAATCATGGACACCAACCCTGATGAGGAATTCTCCCCCAATTCATACCTGCTCAGAGCA 2 93 

MMMMMMMMI MMI MMI MMI M MMMMMI M MMM 

CAAATCATGGACACCAATCCTGACGAGGAGTTCTCTCCTAATTCATACCTACTAAGAGCA 681 
TGCTCAGGGCCCCAGCAAGCCTCCAGCAGTGGCCCTCCGAACCACCACAGCCAGTCGACT 353 

M MMMM MMI M MMMMMMMI I MMI IMMMIIM II 

TGTTCAGGGCCACAGCAGGCATCCAGCAGTGGCCCTTCAAACCATCACAGCCAGTCAACG 741 



CTGAGGCCCCCTCTCCCACCCCCTCACAACCACACGCTGTCCCATCACCACTCGTCCGCC 

MMMM MMMM II llllllllllll MMMMIMM MMMM III 

CTGAGGCCACCTCTCCCCCCTCCTCACAACCACTCGCTGTCCCATCATCACTCGTCTGCC 



413 



801 



AACTCCCTCAACAGGAACTCACTGACCAATCGGCGGAGTCAGATCCACGCCCCGGCCCCA 4 73 

Mllllllllllllllllll II Illll II II I lllllllllll li II M 

AACTCCCTCAACAGGAACTCGCTCACCAACCGCCGCAACCAGATCCACGCGCCTGCTCCC 861 
GCGCCCAATGACCTGGCCACCACACCAGAGTCCGTTCAGCTTCAGGACAGCTGGGTGCTA 533 

II llllllllllllll Illll II Illll II Mill MMMMMMMMI 

GCTCCCAATGACCTGGCGACCACGCCTGAGTCTGTGCAGCTGCAGGACAGCTGGGTGCTC 921 
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QUERY: 


534 


SBJCT: 


922 


QUERY: 


594 


SBJCT : 


982 


QUERY: 


654 


SBJCT : 


1042 


QUERY: 


714 


SBJCT : 


1102 



T^CAGCAACj^^CACTGGAGACCCGGCACTTCCTCTTCAAGACC^^pGGGGAGCACA 

IllllilllWI MIIMIM MM Ml I M MMI Wl Ml M 

AACAGCAACGTGCCGCTGGAGACCAGGCATTTCTTGTTTAAGACATCTTCTGGAACGACT 



593 



981 



653 



CCCTTGTTCAGCAGCTCTTCCCCGGGATACCCTTTGACCTCAGGAACGGTTTACACGCCC 

M IMIIII IIIIMIIIII II lllll IIIIMIIIIIII lllll II II 

CCGCTGTTCAGTAGCTCTTCCCCTGGCTACCCACTGACCTCAGGAACAGTTTATACTCCA 1041 
CCGCCCCGCCTGCTGCCCAGGAATACTTTCTCCAGGAAGGCTTTCAAGCTGAAGAAGCCC 713 

II III I III I II II Mill II IMIIM! II IIIIMIIIII llllll 

CCTCCCAGGCTGTTACCTAGAAATACATTTTCCAGGAATGCATTCAAGCTGAAAAAGCCC 1101 

TCCAAATACTGCAGCTGGAAATGTGCTGC 742 

lllll II II lllllllllllllllll 
TCCAAGTATTGTAGCTGGAAATGTGCTGC 1130 



SCORE = 212 BITS (107), EXPECT = 4E-52 
IDENTITIES = 302/367 (82%) 
STRAND = PLUS / PLUS 



QUERY: 


819 


SBJCT : 


1330 


QUhRY : 


879 


SBJCT : 


1390 


QUERY: 


939 


SBJCT : 


1450 


QUERY: 


999 


SBJCT: 


1510 


QUERY: 


1059 


SBJCT: 


1570 


QUERY: 


1119 


SBJCT : 


1630 


QUERY: 


1179 


SBJCT : 


1690 


SCORE 


= 77 


IDENTITIES 



AGCAGCATAGACAGTGGTGAAGCAGAAGTTGGTCGGCGGGTAACACAAGAAGTCCCACCA 878 

lllllllllll MMI III llllllllll II Ml II lllll II II II 

AGCAGCATAGATAGTGGAGAAACAGAAGTTGGCCGCAAGGTCACCCAAGAGGTGCCCCCT 13 89 
GGGGTGTTTTGGAGGTCACAAATTCACATCAGTCAGCCCCAGTTCTTAAAGTTCAACATC 93 8 

II Mill III MM II II II Mill lllll llllll I lllllllllll 

GGAGTGTTCTGGCGGTCTCAGATCCATATCAGCCAGCCACAGTTCCTGAAGTTCAACATA 1449 



TCCCTCGGGAAGGACGCTCTCTTTGGTGTTTACATAAGAAGAGGACTTCCACCATCTCAT 

Mill Mllllll lllll II llllllll IIIIIIIIIIIMI llllllll III 

TCCCTAGGGAAGGATGCTCTTTTCGGTGTTTATATAAGAAGAGGACTCCCACCATCACAT 



998 



1509 



GCCCAGTATGACTTCATGGAACGTCTGGACGGGAAGGAGAAGTGGAGTGTGGTTGAGTCT 1058 

M llllllll IIIIIIIMII MM lllll lllll lllllllllll II II 

GCACAGTATGATTTCATGGAACGCTTGGATGGGAAAGAGAAATGGAGTGTGGTGGAATCC 1569 

CCCAGGGAACGCCGGAGCATACAGACCTTGGTTCAGAATGAAGCCGTGTTTGTGCAGTAC 1118 

II lllllll M II II Mill I lllllllllll II llllllll llllll 
CCACGGGAACGGCGAAGTATTCAGACTCTTGTTCAGAATGAGGCTGTGTTTGTTCAGTAC 1629 

CTGGATGTGGGCCTGTGGCATCTGGCCTTCTACAATGATGGAAAAGACAAAGAGATGGTT 1178 

llllllllll lllllll lllll M IIIMIMIII II llllllll MM 

TTGGATGTGGGTTTGTGGCACCTGGCGTTTTACAATGATGGCAAGGACAAAGAAGTGGTC 1689 



lllllll 



STRAND 



87/103 (84%) 
PLUS / PLUS 



QUERY: 1258 GATTCAGTGCAGGACTGTCCACGTAACTGCCATGGGAATGGTGAATGTGTGTCCGGGGTG 1317 

IIMIIIMII llllllllllllll II lllll lllll II lllll II II II 
SBJCT: 1711 GATTCAGTGCAAGACTGTCCACGTAATTGTCATGGCAATGGCGAGTGTGTTTCTGGTGTC 1770 

QUERY: 1318 TGTCACTGTTTCCCAGGATTTCTAGGAGCAGACTGTGCTAAAG 1360 

II IIIIIMI M IIIIMI llllllll llllllllll 
SBJCT: 1771 TGCCACTGTTTTCCCGGATTTCATGGAGCAGATTGTGCTAAAG 1813 



In this search it was also found that the fragments of FCTR3bcd and e nucleic acids had 
homology to three fragments of Homo sapiens mRNA for KI/VAl 127 protein. It has 5537 of 
5538 bases (99%) identical to bases 1-5538, 705 of 714 bases (98%) identical to bases 5609- 
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6322, and 176 of 176 ba^(100%) identical to bases 6385-656q^^omo sapiens mRNA for 
KIAAl 127 protein (GeWik Acc: AB032953) (Table 3L). 



Table 3L. BLASTN of FCTR3b, c, d, and e against Homo sapiens KIAAl 127 mRNA (SEQ 

ID NO:64) 

5 > GI I 6329762 I DBJIAB032953 ■ 1 IAB032953 HOMO SAPIENS MRNA FOR KIAA1127 PROTEIN, PARTIAL 
CDS 

LENGTH = 6560 

SCORE = 1.097E+04 BITS (5534), EXPECT = 0.0 
10 IDENTITIES = 5537/5538 (99%) 

STRAND = PLUS / PLUS 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



QUERY: 


3267 


SB JCT : 


1 


QUERY: 


3327 


SB JCT : 


61 


QUERY: 


3387 


SB JCT : 


121 


QUERY : 


3447 


SB JCT : 


181 


QUERY: 


3507 


SB JCT : 


241 


QUERY: 


3567 


SB JCT : 


301 


QUERY : 


3627 


SBJCT : 


361 


QUERY: 


3687 


SBJCT : 


421 


QUERY : 


3747 


SBJCT : 


481 


QUERY: 


3807 


SBJCT : 


541 


QUERY: 


3867 


SBJCT: 


601 


QUERY: 


3927 


SBJCT : 


661 


QUERY: 


3987 


SBJCT : 


721 


QUERY: 


4047 



CACCTTCTTTAGTGCTGCCCCTGGGCAGAATCCCATCGTGCCTGAGACCCAGGTTCTTCA 3326 

IIIIIMIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CACCTTCTTTAGTGCTGCCCCTGGGCAGAATCCCATCGTGCCTGAGACCCAGGTTCTTCA 6 0 
TGAAGAAATCGAGCTCCCTGGTTCCAATGTGAAACTTCGCTATCTGAGCTCTAGAACTGC 3386 

IIIIMIIIIMIIIIilMllllllllllllllllllllllllMlllllllillllll 

TGAAGAAATCGAGCTCCCTGGTTCCAATGTGAAACTTCGCTATCTGAGCTCTAGAACTGC 120 
AGGGTACAAGTCACTGCTGAAGATCACCATGACCCAGTCCACAGTGCCCCTGAACCTCAT 344 6 

llliMlllillllllilllllllllllllllMllllllllllilllllllllllllll 

AGGGTACAAGTCACTGCTGAAGATCACCATGACCCAGTCCACAGTGCCCCTGAACCTCAT 180 

TAGGGTTCACCTGATGGTGGCTGTCGAGGGGCATCTCTTCCAGAAGTCATTCCAGGCTTC 3506 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIillllllllllllllllllMI 
TAGGGTTCACCTGATGGTGGCTGTCGAGGGGCATCTCTTCCAGAAGTCATTCCAGGCTTC 240 

TCCCAACCTGGCCTCCACCTTCATCTGGGACAAGACAGATGCGTATGGCCAAAGGGTGTA 3566 

IIIIMIIMMII illlllllilllllllllllllMllllllllllllllllllill 

TCCCAACCTGGCCTACACCTTCATCTGGGACAAGACAGATGCGTATGGCCAAAGGGTGTA 300 
TGGACTCTCAGATGCTGTTGTGTCTGTCGGGTTTGAATATGAGACCTGTCCCAGTCTAAT 3 626 

MilllillMIIIIMIIIIIIMIIIIIIIIIIirMIIIIIIIIIMIIIIIIilil 

TGGACTCTCAGATGCTGTTGTGTCTGTCGGGTTTGAATATGAGACCTGTCCCAGTCTAAT 360 

TCTCTGGGAGAAAAGGACAGCCCTCCTTCAGGGATTCGAGCTGGACCCCTCCAACCTCGG 3686 

lllllllllllllilllillllllllllllllllllllllllllllllllllllllllll 
TCTCTGGGAGAAAAGGACAGCCCTCCTTCAGGGATTCGAGCTGGACCCCTCCAACCTCGG 420 

TGGCTGGTCCCTAGACAAACACCACATCCTCAATGTTAAAAGTGGAATCCTACACAAAGG 3746 

IIIIIIIIIIMMIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIilllMIIII 

TGGCTGGTCCCTAGACAAACACCACATCCTCAATGTTAAAAGTGGAATCCTACACAAAGG 480 
CACTGGGGAAAACCAGTTCCTGACCCAGCAGCCTGCCATCATCACCAGCATCATGGGCAA 3806 

IIIIMIillllllllllllllllllllllllilllMIIIIIIIIIIIIIIIIMMII 

CACTGGGGAAAACCAGTTCCTGACCCAGCAGCCTGCCATCATCACCAGCATCATGGGCAA 540 
TGGTCGCCGCCGGAGCATTTCCTGTCCCAGCTGCAACGGCCTTGCTGAAGGCAACAAGCT 3866 

IIIIIIIMIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIII 

TGGTCGCCGCCGGAGCATTTCCTGTCCCAGCTGCAACGGCCTTGCTGAAGGCAACAAGCT 600 

GCTGGCCCCAGTGGCTCTGGCTGTTGGAATCGATGGGAGCCTCTATGTGGGTGACTTCAA 3 926 

lllllllllllilllllllllllllllllillllllllllllllllllllMIIIIMII 
GCTGGCCCCAGTGGCTCTGGCTGTTGGAATCGATGGGAGCCTCTATGTGGGTGACTTCAA 660 

TTACATCCGACGCATCTTTCCCTCTCGAAATGTGACCAGCATCTTGGAGTTACGAAATAA 3986 

illlllMMIilllllllilllllMMIIIIMIIIIIIIIIIMIIIIIIIIIIMI 

TTACATCCGACGCATCTTTCCCTCTCGAAATGTGACCAGCATCTTGGAGTTACGAAATAA 720 
AGAGTTTAAACATAGCAACAACCCAGCACACAAGTACTACTTGGCAGTGGACCCCGTGTC 4 046 

IIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 

AGAGTTTAAACATAGCAACAACCCAGCACACAAGTACTACTTGGCAGTGGACCCCGTGTC 780 
CGGCTCGCTCTACGTGTCCGACACCAACAGCAGGAGAATCTACCGCGTCAAGTCTCTGAG 4106 
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SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY : 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY : 
SBJCT : 
QUERY: 
SBJCT : 
QUERY : 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 



781 



IIIMM 
CGGCTCGC 



imi 



iiiiiiiiiiMiiiiiiiiiiiiiiiiiiii 

;gtgtccgacaccaacagcaggagaatctacc( 



ILUlll 
;G^Bf^( 



IIIIIIM 

tGTCTCTGAG 



840 



4107 TGGAACCAAAGACCTGGCTGGGAATTCGGAAGTTGTGGCAGGGACGGGAGAGCAGTGTCT 4166 

IIIIMIIIIIIMIIIIilllllllllilMIIIMIMIIIIIMIIIIIilMilM 

TGGAACCAAAGACCTGGCTGGGAATTCGGAAGTTGTGGCAGGGACGGGAGAGCAGTGTCT 



841 



900 



4167 ACCCTTTGATGAAGCCCGCTGCGGGGATGGAGGGAAGGCCATAGATGCAACCCTGATGAG 4226 

liMIIMIMIIIIIIIIIMIIIIIIIIIIMIMIIIMIIIIIIIIIIIIIIIIII 

901 ACCCTTTGATGAAGCCCGCTGCGGGGATGGAGGGAAGGCCATAGATGCAACCCTGATGAG 960 
4227 CCCGAGAGGTATTGCAGTAGACAAGAATGGGCTCATGTACTTTGTCGATGCCACCATGAT 4286 

IIMIIMIMIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIillllllllllllllll 

961 CCCGAGAGGTATTGCAGTAGACAAGAATGGGCTCATGTACTTTGTCGATGCCACCATGAT 1020 
4287 CCGGAAGGTTGACCAGAATGGAATCATCTCCACCCTGCTGGGCTCCAATGACCTCACTGC 434 6 

IIIIIIIIIIIIIIMMIillllllllllillllllllllllillilllllllllMII 

1021 CCGGAAGGTTGACCAGAATGGAATCATCTCCACCCTGCTGGGCTCCAATGACCTCACTGC 1080 
4347 CGTCCGGCCGCTGAGCTGTGATTCCAGCATGGATGTAGCCCAGGTTCGTCTGGAGTGGCC 4406 

IIIIIMIIIIMIIIIIIIIilllllMIMIIIIIIIIIMIIIIIIIIIIMIMIi 

1081 CGTCCGGCCGCTGAGCTGTGATTCCAGCATGGATGTAGCCCAGGTTCGTCTGGAGTGGCC 1140 
44 07 AACAGACCTTGCTGTCAATCCCATGGATAACTCCTTGTATGTTCTAGAGAACAATGTCAT 4466 

liMIMIIMIIMIIIIIIIIMIIIIIIMIIIIillllllllllllllllllllll 

1141 AACAGACCTTGCTGTCAATCCCATGGATAACTCCTTGTATGTTCTAGAGAACAATGTCAT 12 00 

4467 CCTTCGAATCACCGAGAACCACCAAGTCAGCATCATTGCGGGACGCCCCATGCACTGCCA 4526 

IIIIIIMIIMMIIIIIIIIIIIIilllllllllllllllllllllllllllllllll 
1201 CCTTCGAATCACCGAGAACCACCAAGTCAGCATCATTGCGGGACGCCCCATGCACTGCCA 1260 

4527 AGTTCCTGGCATTGACTACTCACTCAGCAAACTAGCCATTCACTCTGCCCTGGAGTCAGC 4586 

IIIIMIilllllllllMIIIIIIIIIIIIMIIIIIIIIIIIIIIilllllMIIIII 

1261 AGTTCCTGGCATTGACTACTCACTCAGCAAACTAGCCATTCACTCTGCCCTGGAGTCAGC 1320 
4 587 CAGTGCCATTGCCATTTCTCACACTGGGGTCCTCTACATCACTGAGACAGATGAGAAGAA 4646 

IIIIIIMIIIIIIIillllllllllilMlllllllllllllllllllllllllllill 

1321 CAGTGCCATTGCCATTTCTCACACTGGGGTCCTCTACATCACTGAGACAGATGAGAAGAA 13 80 
4 647 GATTAACCGTCTACGCCAGGTAACAACCAACGGGGAGATCTGCCTTTTAGCTGGGGCAGC 4706 

llllillllMIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIMMI 

13 81 GATTAACCGTCTACGCCAGGTAACAACCAACGGGGAGATCTGCCTTTTAGCTGGGGCAGC 1440 

4 707 CTCGGACTGCGACTGCAAAAACGATGTCAATTGCAACTGCTATTCAGGAGATGATGCCTA 4766 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIillllllll 
1441 CTCGGACTGCGACTGCAAAAACGATGTCAATTGCAACTGCTATTCAGGAGATGATGCCTA 1500 

4 767 CGCGACTGATGCCATCTTGAATTCCCCATCATCCTTAGCTGTAGCTCCAGATGGTACCAT 4826 

llllllllllllllllllllllllllllllllllllllllllillllllllllllilill 

1501 CGCGACTGATGCCATCTTGAATTCCCCATCATCCTTAGCTGTAGCTCCAGATGGTACCAT 1560 

4 82 7 TTACATTGCAGACCTTGGAAATATTCGGATCAGGGCGGTCAGCAAGAACAAGCCTGTTCT 4886 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIII 
1561 TTACATTGCAGACCTTGGAAATATTCGGATCAGGGCGGTCAGCAAGAACAAGCCTGTTCT 1620 

4 887 TAATGCCTTCAACCAGTATGAGGCTGCATCCCCCGGAGAGCAGGAGTTATATGTTTTCAA 4946 

MIIIIIMIillMIIMIMIIIIIIMMIIIIIIIIIIIMIIIIIIIIMIIIII 

1621 TAATGCCTTCAACCAGTATGAGGCTGCATCCCCCGGAGAGCAGGAGTTATATGTTTTCAA 1680 
4 94 7 CGCTGATGGCATCCACCAATACACTGTGAGCCTGGTGACAGGGGAGTACTTGTACAATTT 5006 

IIIIIMIIIMIIIIIIIIIIilllllllllilllMIIIIIIIIIIIIIIIIIIIIM 

1681 CGCTGATGGCATCCACCAATACACTGTGAGCCTGGTGACAGGGGAGTACTTGTACAATTT 174 0 

5007 CACATATAGTACTGACAATGATGTCACTGAATTGATTGACAATAATGGGAATTCCCTGAA 5066 

illllllllllllllllllllMIIIIIIIIIIIIIIMIMIIIIIMIIIIIIIIIII 
1741 CACATATAGTACTGACAATGATGTCACTGAATTGATTGACAATAATGGGAATTCCCTGAA 1800 

5067 GATCCGTCGGGACAGCAGTGGCATGCCCCGTCACCTGCTCATGCCTGACAACCAGATCAT 5126 

IIIIIIIIIIMMMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 
1801 GATCCGTCGGGACAGCAGTGGCATGCCCCGTCACCTGCTCATGCCTGACAACCAGATCAT 1860 
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QUERY: 


5127 


SBJCT: 


1861 


QUERY: 


5187 


SBJCT : 


1921 


QUERY: 


5247 


SBJCT : 


1981 


QUERY: 


5307 


SBJCT : 


2041 


QUERY: 


5367 


SBJCT : 


2101 


QUERY: 


5427 


SBJCT : 


2161 


QUERY: 


5487 


SBJCT : 


2221 


QUERY : 


5547 


SBJCT : 


2281 


QUERY: 


5607 


SBJCT : 


2341 


QUERY: 


5667 


SBJCT: 


2401 


QUERY: 


5727 


SBJCT : 


2461 


QUERY: 


5787 


SBJCT : 


2521 


QUERY: 


5847 


SBJCT : 


2581 


QUERY: 


5907 


SBJCT : 


2641 


QUERY: 


5967 


SBJCT : 


2701 


QUERY: 


6027 


SBJCT: 


2761 


QUERY : 


6087 


SBJCT : 


2821 


QUERY : 


6147 



CACCCTCAO^^GGCACCAATGGAGGCCTCAAAGTCGTGTCCAC^^^^CCTGGAGCT 5186 

IIIIMIlWlllllllllllllMlllillllllillllllWlllllllllll 

CACCCTCACCGTGGGCACCAATGGAGGCCTCAAAGTCGTGTCCACACAGAACCTGGAGCT 1920 



5246 



TGGTCTCATGACCTATGATGGCAACACTGGGCTCCTGGCCACCAAGAGCGATGAAACAGG 

IIIIIIIIIIIIIIIIIIMMIIIMMIIIIIIMIIIIMIMIMIIIIMIIill 

TGGTCTCATGACCTATGATGGCAACACTGGGCTCCTGGCCACCAAGAGCGATGAAACAGG 1980 

ATGGACGACTTTCTATGACTATGACCACGAAGGCCGCCTGACCAACGTGACGCGCCCCAC 5306 

IIIIIIIIIIIIIIMIIIilllllllllllllllllllllllllllllillllilMII 
ATGGACGACTTTCTATGACTATGACCACGAAGGCCGCCTGACCAACGTGACGCGCCCCAC 2040 

GGGGGTGGTAACCAGTCTGCACCGGGAAATGGAGAAATCTATTACCATTGACATTGAGAA 5366 

lllllllllllllllilllllllMllllillllMIIIIIIIIIIIIIIIIIIIIIIII 
GGGGGTGGTAACCAGTCTGCACCGGGAAATGGAGAAATCTATTACCATTGACATTGAGAA 2100 

CTCCAACCGTGATGATGACGTCACTGTCATCACCAACCTCTCTTCAGTAGAGGCCTCCTA 5426 

MIIIIIIIIIMIilMIIIIIIIIMIillllllMIMMIIIIIIIIIIIIIIIII 

CTCCAACCGTGATGATGACGTCACTGTCATCACCAACCTCTCTTCAGTAGAGGCCTCCTA 2160 
CACAGTGGTACAAGATCAAGTTCGGAACAGCTACCAGCTCTGTAATAATGGTACCCTGAG 54 86 

IIIIIIMIIIIMMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CACAGTGGTACAAGATCAAGTTCGGAACAGCTACCAGCTCTGTAATAATGGTACCCTGAG 2220 

GGTGATGTATGCTAATGGGATGGGTATCAGCTTCCACAGCGAGCCCCATGTCCTAGCGGG 5546 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
GGTGATGTATGCTAATGGGATGGGTATCAGCTTCCACAGCGAGCCCCATGTCCTAGCGGG 2280 

CACCATCACCCCCACCATTGGACGCTGCAACATCTCCCTGCCTATGGAGAATGGCTTAAA 5606 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CACCATCACCCCCACCATTGGACGCTGCAACATCTCCCTGCCTATGGAGAATGGCTTAAA 2340 

CTCCATTGAGTGGCGCCTAAGAAAGGAACAGATTAAAGGCAAAGTCACCATCTTTGGCAG 5666 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIillllllllllllllllllll 
CTCCATTGAGTGGCGCCTAAGAAAGGAACAGATTAAAGGCAAAGTCACCATCTTTGGCAG 2400 

GAAGCTCCGGGTCCATGGAAGAAATCTCTTGTCCATTGACTATGATCGAAATATTCGGAC 5726 

IIIIMIIIIIMIIIIIIilllllMllllllllllllllillilllMIIIIIIIIII 

GAAGCTCCGGGTCCATGGAAGAAATCTCTTGTCCATTGACTATGATCGAAATATTCGGAC 2460 

TGAAAAGATCTATGATGACCACCGGAAGTTCACCCTGAGGATCATTTATGACCAGGTGGG 5786 

IIIIIIIIIIIIIMIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
TGAAAAGATCTATGATGACCACCGGAAGTTCACCCTGAGGATCATTTATGACCAGGTGGG 2520 

CCGCCCCTTCCTCTGGCTGCCCAGCAGCGGGCTGGCAGCTGTCAACGTGTCATACTTCTT 5846 

IIIIIIIIIMIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 

CCGCCCCTTCCTCTGGCTGCCCAGCAGCGGGCTGGCAGCTGTCAACGTGTCATACTTCTT 2580 
CAATGGGCGCCTGGCTGGGCTTCAGCGTGGGGCCATGAGCGAGAGGACAGACATCGACAA 5906 

IIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIillllllllMllllillllll 

CAATGGGCGCCTGGCTGGGCTTCAGCGTGGGGCCATGAGCGAGAGGACAGACATCGACAA 2640 
GCAAGGCCGCATCGTGTCCCGCATGTTCGCTGACGGGAAAGTGTGGAGCTACTCCTACCT 5966 

MllllllllllllilllllllMlllllilllllllllllllMIIIIIIIIMIIIII 

GCAAGGCCGCATCGTGTCCCGCATGTTCGCTGACGGGAAAGTGTGGAGCTACTCCTACCT 2 700 
TGACAAGTCCATGGTCCTCCTGCTTCAGAGCCAACGTCAGTATATATTTGAGTATGACTC 6026 

llllillllilillMlllllllillMIIIIIIIMIIMIMIIIIIIIIIIIMIII 

TGACAAGTCCATGGTCCTCCTGCTTCAGAGCCAACGTCAGTATATATTTGAGTATGACTC 2760 
CTCTGACCGCCTCCTTGCCGTCACCATGCCCAGCGTGGCCCGGCACAGCATGTCCACACA 6086 

ililMllllllllllllllllllllllillllliMlllllllilllMlllllllill 

CTCTGACCGCCTCCTTGCCGTCACCATGCCCAGCGTGGCCCGGCACAGCATGTCCACACA 2820 
CACCTCCATCGGCTACATCCGTAATATTTACAACCCGCCTGAAAGCAATGCTTCGGTCAT 6146 

IIIIIMIIIIIIIilllllllllllllMlllllllilllllllllilMilllllMI 

CACCTCCATCGGCTACATCCGTAATATTTACAACCCGCCTGAAAGCAATGCTTCGGTCAT 2880 
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SB JCT : 


2881 


QUERY: 


6207 


SB JCT : 


2941 


QUERY: 


6267 


SB JCT : 


3001 


QUERY: 


6327 


SB JCT : 


3061 


QUERY: 


6387 


SB JCT : 


3121 


QUERY: 


6447 


SB JCT: 


3181 


QUERY: 


6507 


SB JCT : 


3241 


QUERY : 


6567 


SB JCT : 


3301 


QUERY: 


6627 


SB JCT : 


3361 


QUERY: 


6687 


SB JCT : 


3421 


QUERY: 


6747 


SB JCT : 


3481 


QUERY: 


6807 


SB JCT : 


3541 


QUERY: 


6867 


SB JCT : 


3601 


QUERY: 


6927 


SB JCT : 


3661 


QUERY: 


6987 


SB JCT : 


3721 


QUERY : 


7047 


SB JCT : 


3781 


QUERY: 


7107 






QUERY: 


7167 


SB JCT : 


3901 



IlillllllL 

CTTTGACT. 




lllllllllllllilllllllllMIIIM 
.TGACGGCCGCATCCTGAAGACCTCCTTTT' 



I LLU I 
T]^WV( 



llllllllll 
CCGGACGCCA 294 0 



GGTGTTCTACAAGTATGGGAAACTCTCCAAGTTATCAGAGATTGTCTACGACAGTACCGC 6266 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 
GGTGTTCTACAAGTATGGGAAACTCTCCAAGTTATCAGAGATTGTCTACGACAGTACCGC 3000 

CGTCACCTTCGGGTATGACGAGACCACTGGTGTCTTGAAGATGGTCAACCTCCAAAGTGG 6326 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIilllMIIIIIIIIIIII 

CGTCACCTTCGGGTATGACGAGACCACTGGTGTCTTGAAGATGGTCAACCTCCAAAGTGG 3060 
GGGCTTCTCCTGCACCATCAGGTACCGGAAGATTGGCCCCCTGGTGGACAAGCAGATCTA 6386 

IIIIMIIIIIIIIIIIIIIilllllllllllllllllllllllllllllllllllllM 

GGGCTTCTCCTGCACCATCAGGTACCGGAAGATTGGCCCCCTGGTGGACAAGCAGATCTA 3120 

CAGGTTCTCCGAGGAAGGCATGGTCAATGCCAGGTTTGACTACACCTATCATGACAACAG 6446 

lllllllllllillllllllllllllllllllllllMIIMIIIIIIIIIIIIIIIIII 
CAGGTTCTCCGAGGAAGGCATGGTCAATGCCAGGTTTGACTACACCTATCATGACAACAG 3180 

CTTCCGCATCGCAAGCATCAAGCCCGTCATAAGTGAGACTCCCCTCCCCGTTGACCTCTA 6506 

llillllllllllllllllllllllllllllllllllllllllllillllllllllllll 
CTTCCGCATCGCAAGCATCAAGCCCGTCATAAGTGAGACTCCCCTCCCCGTTGACCTCTA 324 0 

CCGCTATGATGAGATTTCTGGCAAGGTGGAACACTTTGGTAAGTTTGGAGTCATCTATTA 6566 

llllillllllllllllllllllllllllllllllMllllllllllillMIIIIIIII 

CCGCTATGATGAGATTTCTGGCAAGGTGGAACACTTTGGTAAGTTTGGAGTCATCTATTA 3300 

TGACATCAACCAGATCATCACCACTGCCGTGATGACCCTCAGCAAACACTTCGACACCCA 6626 

illlllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
TGACATCAACCAGATCATCACCACTGCCGTGATGACCCTCAGCAAACACTTCGACACCCA 3360 

TGGGCGGATCAAGGAGGTCCAGTATGAGATGTTCCGGTCCCTCATGTACTGGATGACGGT 6686 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
TGGGCGGATCAAGGAGGTCCAGTATGAGATGTTCCGGTCCCTCATGTACTGGATGACGGT 3420 

GCAATATGACAGCATGGGCAGGGTGATCAAGAGGGAGCTAAAACTGGGGCCCTATGCCAA 674 6 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIII 
GCAATATGACAGCATGGGCAGGGTGATCAAGAGGGAGCTAAAACTGGGGCCCTATGCCAA 34 80 

TACCACGAAGTACACCTATGACTACGATGGGGACGGGCAGCTCCAGAGCGTGGCCGTCAA 6806 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 
TACCACGAAGTACACCTATGACTACGATGGGGACGGGCAGCTCCAGAGCGTGGCCGTCAA 354 0 

TGACCGCCCGACCTGGCGCTACAGCTATGACCTTAATGGGAATCTCCACTTACTGAACCC 6866 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
TGACCGCCCGACCTGGCGCTACAGCTATGACCTTAATGGGAATCTCCACTTACTGAACCC 3600 

AGGCAACAGTGTGCGCCTCATGCCCTTGCGCTATGACCTCCGGGATCGGATAACCAGACT 6926 

lllllllllllllllllllllllllllllllllllllllllllllllllilllllllMI 
AGGCAACAGTGTGCGCCTCATGCCCTTGCGCTATGACCTCCGGGATCGGATAACCAGACT 3660 

CGGGGATGTGCAGTACAAAATTGACGACGATGGCTATCTGTGCCAGAGAGGGTCTGACAT 6986 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIllllllllllll 
CGGGGATGTGCAGTACAAAATTGACGACGATGGCTATCTGTGCCAGAGAGGGTCTGACAT 3 720 

CTTCGAATACAATTCCAAGGGCCTCCTAACAAGAGCCTACAACAAGGCCAGCGGGTGGAG 704 6 

llllllllllilllllllllllllllllllllllllllllllMIIIIIIIIIIIIIMI 
CTTCGAATACAATTCCAAGGGCCTCCTAACAAGAGCCTACAACAAGGCCAGCGGGTGGAG 3780 

TGTCCAGTACCGCTATGATGGCGTAGGACGGCGGGCTTCCTACAAGACCAACCTGGGCCA 7106 

lllllllllilllllllllllMIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIII 

TGTCCAGTACCGCTATGATGGCGTAGGACGGCGGGCTTCCTACAAGACCAACCTGGGCCA 3840 

CCACCTGCAGTACTTCTACTCTGACCTCCACAACCCGACGCGCATCACCCATGTCTACAA 7166 

lllllllllllillllllllllllllllllllllllllllllllllllllllllllllll 
CCACCTGCAGTACTTCTACTCTGACCTCCACAACCCGACGCGCATCACCCATGTCTACAA 3 900 

TCACTCCAACTCGGAGATTACCTCACTGTACTACGACCTCCAGGGCCACCTCTTTGCCAT 7226 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
TCACTCCAACTCGGAGATTACCTCACTGTACTACGACCTCCAGGGCCACCTCTTTGCCAT 3 960 

46 15966-697 



QUERY: 
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QUERY: 
SB JCT : 
QUERY: 
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QUERY: 
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QUERY: 
SB JCT : 
QUERY: 



7227 GGAGAGCA( 

llllllll 
3961 GGAGAGCAGi 



.C^^I'GGGG. 

Hlill 

.gcSgtgggq 



;AGGAGTACTATGTTGCCTCTGATAACAC^^^CTCCTCTGGC 7286 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 iWl 1 1 M 1 1 1 1 1 

TGGGGAGGAGTACTATGTTGCCTCTGATAACACAGGGACTCCTCTGGC 4020 



7287 TGTGTTCAGCATCAACGGCCTCATGATCAAACAGCTGCAGTACACGGCCTATGGGGAGAT 7346 

lllillllllllilMlllliilllllllllllillllllllllllilllllllMIMI 

4 021 TGTGTTCAGCATCAACGGCCTCATGATCAAACAGCTGCAGTACACGGCCTATGGGGAGAT 4080 
7347 TTATTATGACTCCAACCCCGACTTCCAGATGGTCATTGGCTTCCATGGGGGACTCTATGA 7406 

lllllllllllilllllllllllllilllllllllllllMIMIIIIIIIIIIIIIill 

4081 TTATTATGACTCCAACCCCGACTTCCAGATGGTCATTGGCTTCCATGGGGGACTCTATGA 4140 
7407 CCCCCTGACCAAGCTGGTCCACTTCACTCAGCGTGATTATGATGTGCTGGCAGGACGATG 7466 

IIIIIMIIIIillllllllllllllMIIIIIIIIIIMMMIIIIIIIillllMM 

4141 CCCCCTGACCAAGCTGGTCCACTTCACTCAGCGTGATTATGATGTGCTGGCAGGACGATG 4200 
7467 GACCTCCCCAGACTATACCATGTGGAAAAACGTGGGCAAGGAGCCGGCCCCCTTTAACCT 7526 

llllillllllllllllMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIiillllMM 

4201 GACCTCCCCAGACTATACCATGTGGAAAAACGTGGGCAAGGAGCCGGCCCCCTTTAACCT 4260 
752 7 GTATATGTTCAAGAGCAACAATCCTCTCAGCAGTGAGCTAGATTTGAAGAACTACGTGAC 7586 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIilllllll 

4261 GTATATGTTCAAGAGCAACAATCCTCTCAGCAGTGAGCTAGATTTGAAGAACTACGTGAC 4320 
7587 AGATGTGAAAAGCTGGCTTGTGATGTTTGGATTTCAGCTTAGCAACATCATTCCTGGCTT 7646 

IIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIillllllllllllll 

4321 AGATGTGAAAAGCTGGCTTGTGATGTTTGGATTTCAGCTTAGCAACATCATTCCTGGCTT 4380 
7647 CCCGAGAGCCAAAATGTATTTCGTGCCTCCTCCCTATGAATTGTCAGAGAGTCAAGCAAG 7706 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

43 81 CCCGAGAGCCAAAATGTATTTCGTGCCTCCTCCCTATGAATTGTCAGAGAGTCAAGCAAG 444 0 
7707 TGAGAATGGACAGCTCATTACAGGTGTCCAACAGACAACAGAGAGACATAACCAGGCCTT 7766 

IIIIIIIIIMIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIII 

4441 TGAGAATGGACAGCTCATTACAGGTGTCCAACAGACAACAGAGAGACATAACCAGGCCTT 4500 
7767 CATGGCTCTGGAAGGACAGGTCATTACTAAAAAGCTCCACGCCAGCATCCGAGAGAAAGC 7826 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

4501 CATGGCTCTGGAAGGACAGGTCATTACTAAAAAGCTCCACGCCAGCATCCGAGAGAAAGC 4560 
7827 AGGTCACTGGTTTGCCACCACCACGCCCATCATTGGCAAAGGCATCATGTTTGCCATCAA 7886 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

4 561 AGGTCACTGGTTTGCCACCACCACGCCCATCATTGGCAAAGGCATCATGTTTGCCATCAA 4620 
7887 AGAAGGGCGGGTGACCACGGGCGTGTCCAGCATCGCCAGCGAAGATAGCCGCAAGGTGGC 7946 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMI 

4621 AGAAGGGCGGGTGACCACGGGCGTGTCCAGCATCGCCAGCGAAGATAGCCGCAAGGTGGC 4680 
7947 ATCTGTGCTGAACAACGCCTACTACCTGGACAAGATGCACTACAGCATCGAGGGCAAGGA 8006 

IMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

4 681 ATCTGTGCTGAACAACGCCTACTACCTGGACAAGATGCACTACAGCATCGAGGGCAAGGA 4 74 0 
8007 CACCCACTACTTTGTGAAGATTGGCTCAGCCGATGGCGACCTGGTCACACTAGGCACCAC 8066 

IIIIIIIIIIIMIIIIIIillllllllllllllllllllllllllllllllllllllM 

4741 CACCCACTACTTTGTGAAGATTGGCTCAGCCGATGGCGACCTGGTCACACTAGGCACCAC 4 800 
8067 CATCGGCCGCAAGGTGCTAGAGAGCGGGGTGAACGTGACCGTGTCCCAGCCCACGCTGCT 8126 

llllllilMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

4 801 CATCGGCCGCAAGGTGCTAGAGAGCGGGGTGAACGTGACCGTGTCCCAGCCCACGCTGCT 4860 
8127 GGTCAACGGCAGGACTCGAAGGTTCACGAACATTGAGTTCCAGTACTCCACGCTGCTGCT 8186 

IIIIIIIIIIIIIIMIIIIIIIIilllllMlllllillllMIIIIIIIMIIIMII 

4 861 GGTCAACGGCAGGACTCGAAGGTTCACGAACATTGAGTTCCAGTACTCCACGCTGCTGCT 4 920 
8187 CAGCATCCGCTATGGCCTCACCCCCGACACCCTGGACGAAGAGAAGGCCCGCGTCCTGGA 8246 

IMIIIIIIMIIMIIilllllMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 

4 921 CAGCATCCGCTATGGCCTCACCCCCGACACCCTGGACGAAGAGAAGGCCCGCGTCCTGGA 4 980 

824 7 CCAGGCGAGACAGAGGGCCCTGGGCACGGCCTGGGCCAAGGAGCAGCAGAAAGCCAGGGA 8306 
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SBJCT 

QUERY: 8307 

SBJCT: 5041 

QUERY: 8367 

SBJCT: 5101 

QUERY: 84 27 

SBJCT: 5161 



mill 

4981 CCAGGCGA( 



IliJI 

W 



llllllilllllllllllllllllllll 
fAGGGCCCTGGGCACGGCCTGGGCCAAGGAG^ 



jlLLUIil 
CA^B^\AA( 



llllllll 

.GCCAGGGA 5040 



CGGGAGAGAGGGGAGCCGCCTGTGGACTGAGGGCGAGAAGCAGCAGCTTCTGAGCACCGG 8366 

llllllllllllllllllllllllllllllllllllllllllllllllllllliilllll 
CGGGAGAGAGGGGAGCCGCCTGTGGACTGAGGGCGAGAAGCAGCAGCTTCTGAGCACCGG 5100 

GCGCGTGCAAGGGTACGAGGGATATTACGTGCTTCCCGTGGAGCAATACCCAGAGCTTGC 8426 

llllilllllllllllllllllllllMllllllllillMIIIIIIIIIIIIIIIIIII 

GCGCGTGCAAGGGTACGAGGGATATTACGTGCTTCCCGTGGAGCAATACCCAGAGCTTGC 5160 

AGACAGTAGCAGCAACATCCAGTTTTTAAGACAGAATGAGATGGGAAAGAGGTAACAAAA 84 86 

lillMllllllllllllllllllllllllllllllllllllllllllllllilllllll 
AGACAGTAGCAGCAACATCCAGTTTTTAAGACAGAATGAGATGGGAAAGAGGTAACAAAA 5220 



QUERY: 84 87 TAATCTGCTGCCATTCCTTGTCTGAATGGCTCAGCAGGAGTAACTGTTATCTCCTCTCCT 8546 

IIIIIIIMIIMIIIIillllllllllllllllllllllllllllllllllllllllll 
SBJCT: 5221 TAATCTGCTGCCATTCCTTGTCTGAATGGCTCAGCAGGAGTAACTGTTATCTCCTCTCCT 5280 

QUERY: 854 7 AAGGAGATGAAGACCTAACAGGGGCACTGCGGCTGGGCTGCTTTAGGAGACCAAGTGGCA 8606 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
SBJCT: 52 81 AAGGAGATGAAGACCTAACAGGGGCACTGCGGCTGGGCTGCTTTAGGAGACCAAGTGGCA 534 0 

QUERY: 8607 AGAAAGCTCACATTTTTTGAGTTCAAATGCTACTGTCCAAGCGAGAAGTCCCTCATCCTG 8666 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
SBJCT: 5341 AGAAAGCTCACATTTTTTGAGTTCAAATGCTACTGTCCAAGCGAGAAGTCCCTCATCCTG 5400 

QUERY: 8667 AAGTAGACTAAAGCCCGGCTGAAAATTCCGAGGAAAACAAAACAAACGAATGAATGAACA 8726 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
SBJCT: 5401 AAGTAGACTAAAGCCCGGCTGAAAATTCCGAGGAAAACAAAACAAACGAATGAATGAACA 5460 

QUERY: 8727 GACACACACAATGTTCCAAGTTCCCCTAAAATATGACCCACTTGTTCTGGGTCTACGCAG 8786 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
SBJCT: 5461 GACACACACAATGTTCCAAGTTCCCCTAAAATATGACCCACTTGTTCTGGGTCTACGCAG 5520 

QUERY: 8787 AAAAGAGACGCAAAGTGT 8804 

Illlllllllllllllll 
SBJCT: 5521 AAAAGAGACGCAAAGTGT 5538 

SCORE = 1362 BITS (687), EXPECT = 0.0 
IDENTITIES = 705/714 (98%) 
STRAND = PLUS / PLUS 

QUERY: 8875 CACGGACCGATAAACAAAGAAGCGAAGATAAGAAAGAAGGCCTCATATCCAATTACCTCA 8 934 

llllllllllllllllllllllllllllllllllllllllllllilllllllllllllll 
SBJCT: 5609 CACGGACCGATAAACAAAGAAGCGAAGATAAGAAAGAAGGCCTCATATCCAATTACCTCA 5668 

QUERY: 8935 CTCATTCACATGTGAGCGACACGCAGACATCCGCGAGGGCCAGCGTCACCAGACCAGCTG 8 994 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
SBJCT: 5669 CTCATTCACATGTGAGCGACACGCAGACATCCGCGAGGGCCAGCGTCACCAGACCAGCTG 5728 

QUERY: 8995 CGGGACAAACCACTCAGACTGCTTGTAGGACAAATACTTCTGACATTTTCGTTTAAGCAA 9054 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
SBJCT: 5729 CGGGACAAACCACTCAGACTGCTTGTAGGACAAATACTTCTGACATTTTCGTTTAAGCAA 5788 

QUERY: 9055 ATACAGGTGCATTTAAAACACGACTTTGGGGGTGATTTGTGTGTAGCGCCTGGGGAGGGG 9114 

llllllllllilllllllllllilllllllllllllllllllllllllllllllllllll 
SBJCT: 5789 ATACAGGTGCATTTAAAACACGACTTTGGGGGTGATTTGTGTGTAGCGCCTGGGGAGGGG 5848 



QUERY: 9115 GGATAAAAGAGGAGGAGTGAGCACTGGAAATACTTTTTAAAGNNNNNNNNNCATGAGGGA 9174 

Milllllllllllllllllllllllllllllllllllllll llllllll! 
SBJCT: 5849 GGATAAAAGAGGAGGAGTGAGCACTGGAAATACTTTTTAAAGAAAAAAAAACATGAGGGA 5908 

QUERY: 9175 ATAAAAGAAATTCCTATCAAAAATCAAAGTGAAATAATACCATCCAGCACTTAACTCTCA 9234 

lllllllilllllllllllllllllllllllllllllllilMIIIIIIIMIIIIIIII 

SBJCT: 5909 ATAAAAGAAATTCCTATCAAAAATCAAAGTGAAATAATACCATCCAGCACTTAACTCTCA 5968 

QUERY: 9235 GGTCCCAACTAAGTCTGGCCTGAGCTAATTTATTTGAGCGCAGAGTGTAAAATTTAATTC 9294 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
SBJCT: 5969 GGTCCCAACTAAGTCTGGCCTGAGCTAATTTATTTGAGCGCAGAGTGTAAAATTTAATTC 6028 
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QUERY: 92 95 



SBJCT: 6029 



AAAATGGTi 

llllllll 
AAAATGGTGGi 



"GGCTA' 



TAATCACTACAGATAAATTTCATACTCTTTTG^^rGGAGATTCCA 93 54 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiWiiiiiiiiiM 

ATAATCACTACAGATAAATTTCATACTCTTTTGTCTTTGGAGATTCCA 6088 



QUERY: 9355 TTGTGGACAGTAATACGCAGTTACAGGGTGTAGTCTGTTTAGATTCCGTAGTTCGTGGGT 9414 

lllllillllllllMIIIIIIIIMIIIIIIIIIIMIIIMIIIIIIIIIilllllll 

SBJCT : 6089 TTGTGGACAGTAATACGCAGTTACAGGGTGTAGTCTGTTTAGATTCCGTAGTTCGTGGGT 6148 
QUERY: 9415 ATCAGTTTCGGTAGAGGTGCAGCATCGTGACACTTTTGCTAACAGGTACCACTTCTGATC 9474 

MIIMMIIIIIMIMIIMIIIIIMIIIIillllMIIIMIIIIIIIIIIIIMI 

SBJCT: 614 9 ATCAGTTTCGGTAGAGGTGCAGCATCGTGACACTTTTGCTAACAGGTACCACTTCTGATC 6208 
QUERY: 94 75 ACCCTGTACATACATGAGCCGAAAGGCACAATCACTGTTTCAGATTTAAAATTATTAGTG 9534 

lllllllilllllMIIIMIMIIIIIMIIIIIIIIIIIIIIIillilllllllllM 

SBJCT: 6209 ACCCTGTACATACATGAGCCGAAAGGCACAATCACTGTTTCAGATTTAAAATTATTAGTG 6268 
QUERY: 9535 TGTTTGTTTGGTCCAGAT^CTGAGACAATCACATGACAGTCACCACGAGGAGAG 9588 

IIIMIIIIIIIIIIMIIIIIMilllMIIIIIMIIIIIIIIIIIIIIIII 

SBJCT: 6269 TGTTTGTTTGGTCCAGAAACTGAGACAATCACATGACAGTCACCACGAGGAGAG 6322 

SCORE = 349 BITS (176), EXPECT = 2E-92 
IDENTITIES = 176/176 (100%) 
STRAND = PLUS / PLUS 

GTCTAATAAGAACTTTGGTACAGGAACTTTTTTGTAATATACATGTATGAATTGTTCATC 9710 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMilllllllllilllilillllllll 

GTCTAATAAGAACTTTGGTACAGGAACTTTTTTGTAATATACATGTATGAATTGTTCATC 6444 

GAGTTTTTATATTAATTTTAATTTGCTGCTAAGCAAAGACTAGGGACAGGCAAAGATAAT 9770 

lllllllillllllllllllllllllllllllllllllllllllllllllllllllllll 
GAGTTTTTATATTAATTTTAATTTGCTGCTAAGCAAAGACTAGGGACAGGCAAAGATAAT 6504 

TTATGGCAAAGTGTTTAAATTGTTTATACATAAATAAAGTCTCTAAAACTCCTGTG 9826 

IIMIIIIIIIIIIIilillllllllllllllllllllililllllllllllllll 

TTATGGCAAAGTGTTTAAATTGTTTATACATAAATAAAGTCTCTAAAACTCCTGTG 6560 



QUERY: 


9651 


SBJCT : 


6385 


QUERY : 


9711 


SBJCT : 


6445 


QUERY: 


9771 


SBJCT : 


6505 



In this search it was also found that the FCTRSbcd and e nucleic acids had homology to 
five fragments of Mus musculus mRNA for Ten-m2. It has 5498 of 6108 bases (90%) identical to 
bases 2504-8610, 1095 of 1 196 bases (91%) identical to bases 103-1298, 1000 of 1088 bases 
(91%) identical to bases 1420-2540, 81 of 89 bases (91%) identical to bases 8655-8743, and 30 
of 32 bases (93%) identical to bases 7-38 of Mus musculus mRNA for Ten-m2 (Table 3M). 

Table 3M. BLASTN of FCTR3b, c, d, and e against Mus musculus mRNA for Ten-m2 

Mrna (SEQ ID NO:65) 

> GI|4 760777|DBJ|AB025411.1 IAB025411 MUS MUSCULUS MRNA FOR TEN-M2, COMPLETE CDS 
LENGTH = 8797 

SCORE = 7263 BITS (3664), EXPECT = 0.0 
IDENTITIES = 5498/6108 (90%), GAPS = 1/6108 (0%) 
STRAND = PLUS / PLUS 



QUERY : 


2578 


SBJCT : 


2504 


QUERY : 


2638 


SBJCT: 


2564 


QUERY: 


2698 



IIIMIIIII 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMII 



lllllillllllllllllllllllllllllll lllllllllilllllllllllM I 



11 Mllllllllllllllllllllllllllllllllllll I 
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SBJCT: 


2624 


QUERY: 


2758 


SBJCT : 


2684 


QUERY: 


2818 


SBJCT : 


2744 


QUERY : 


2878 


SBJCT : 


2804 


QUERY: 


2938 


SBJCT : 


2864 


QUERY: 


2998 


SBJCT : 


2924 


QUERY: 


3058 


SBJCT : 


2984 


QUERY: 


3118 


SBJCT : 


3044 


QUERY: 


3178 


SBJCT : 


3104 


QUERY : 


3238 


SBJCT : 


3164 


QUERY: 


3298 


SBJCT : 


3224 


QUERY : 


3358 


SBJCT : 


3284 


QUERY: 


3418 


SBJCT : 


3344 


QUERY: 


3478 


SBJCT : 


3404 


QUERY: 


3538 


SBJCT : 


3464 


QUERY : 


3598 


SBJCT : 


3524 


QUERY: 


3658 


SBJCT : 


3584 


QUERY: 


3718 


SBJCT : 


3644 



TGCGCTGATAACAAGGATAATGAGGGAGATGGCCTGGTGGACTGC 



TGCCTGCAG^HIcCTGTCAGAACAGCCTGCTCTGCCGGGGGTCCa^PSACCCACTGGAC 2817 

IIIII llimilllllllllllllilllllllllllllilll IIIIIIII IIIII 

TGCCTACAGTCAGCCTGTCAGAACAGCCTGCTCTGCCGGGGGTCTCGGGACCCCTTGGAC 2743 

ATCATTCAGCAGGGCCAGACGGATTGGCCCGCAGTGAAGTCCTTCTATGACCGTATCAAG 2877 

IIIIIIIIIII II IIIII II IIIII lllllllllllllllllllllil Illlll 

ATCATTCAGCAAGGTCAGACAGACTGGCCTGCAGTGAAGTCCTTCTATGACCGCATCAAG 2803 

CTCTTGGCAGGCAAGGATAGCACCCACATCATTCCTGGAGAGAACCCTTTCAACAGCAGC 2937 

Illllllllllllllll IIIMIIIIIIIIIIIIIIIIII IIIII IIIII Illlll 

CTCTTGGCAGGCAAGGACAGCACCCACATCATTCCTGGAGACAACCCCTTCAATAGCAGC 2863 



:C^^A( 



ACCCTGACTGC 2683 



TTGGTTTCTCTCATCCGAGGCCAAGTAGTAACTACAGATGGAACTCCCCTGGTCGGTGTG 

MM IIIII IIIIIIMIIIIMIIIIII I IIIII Illlll MM MMM 

CTGGTGTCTCTGATCCGAGGCCAAGTAGTAACCATGGATGGGACTCCCTTGGTGGGTGTG 



2997 



2923 



AACGTGTCTTTTGTCAAGTACCCAAAATACGGCTACACCATCACCCGCCAGGATGGCACG 3057 

II IMIIIIIIIIIIIIIIIIIIIIIII llllllllllllll lllllllllllllll 

AATGTGTCTTTTGTCAAGTACCCAAAATATGGCTACACCATCACTCGCCAGGATGGCACG 2983 

TTCGACCTGATCGCAAATGGAGGTGCTTCCTTGACTCTACACTTTGAGCGAGCCCCGTTC 3117 

II IIIIIIII II IIIII III II Illlll I III Illllllllllllllll III 

TTTGACCTGATTGCCAATGGGGGTTCTGCCTTGACTCTTCACTTTGAGCGAGCCCCTTTC 3043 
ATGAGCCAGGAGCGCACTGTGTGGCTGCCGTGGAACAGCTTTTACGCCATGGACACCCTG 3177 

IIIIMIIIIIIIIIII IIIIIIIIIII IIIIIIIIIII II IMIIIIIMIMII 

ATGAGCCAGGAGCGCACAGTGTGGCTGCCATGGAACAGCTTCTATGCCATGGACACCCTG 3103 

GTGATGAAGACCGAGGAGAACTCCATCCCCAGCTGTGACCTCAGTGGCTTTGTCCGGCCT 3237 

II llllllllllllll lllllllllllllllllllllllllllllllllllllllll 
GTAATGAAGACCGAGGAAAACTCCATCCCCAGCTGTGACCTCAGTGGCTTTGTCCGGCCA 3163 

GATCCAATCATCATCTCCTCCCCACTGTCCACCTTCTTTAGTGCTGCCCCTGGGCAGAAT 3297 

MMMMMMMMMM II IMIIIIIIIIMI II III Illlll III 

GATCCAATCATCATCTCCTCTCCTCTGTCCACCTTCTTCAGCGCTTCCCCTGCCTCGAAC 3223 

CCCATCGTGCCTGAGACCCAGGTTCTTCATGAAGAAATCGAGCTCCCTGGTTCCAATGTG 3357 

IIIII llllllllllllllllllllllllllllllll llllllllllll IIIIIIII 
CCCATTGTGCCTGAGACCCAGGTTCTTCATGAAGAAATTGAGCTCCCTGGTACCAATGTG 3283 

AAACTTCGCTATCTGAGCTCTAGAACTGCAGGGTACAAGTCACTGCTGAAGATCACCATG 3417 

II II 11 IIIII llllllllllllllllllll IIIII llllllllllllllllll 
AAGCTCCGTTATCTCAGCTCTAGAACTGCAGGGTATAAGTCGCTGCTGAAGATCACCATG 3343 

ACCCAGTCCACAGTGCCCCTGAACCTCATTAGGGTTCACCTGATGGTGGCTGTCGAGGGG 3477 

II lllllllllllllll llllllllll IIIIIIIM lllllll IIIII Illlll 
ACGCAGTCCACAGTGCCCTTGAACCTCATCAGGGTTCACTTGATGGTTGCTGTAGAGGGG 3403 

CATCTCTTCCAGAAGTCATTCCAGGCTTCTCCCAACCTGGCCTCCACCTTCATCTGGGAC 353 7 

IIIMillllllllllllllllllllllllllllllM MM III IIIIMIIIMI 

CATCTCTTCCAGAAGTCATTCCAGGCTTCTCCCAACCTAGCCTACACATTCATCTGGGAC 3463 

AAGACAGATGCGTATGGCCAAAGGGTGTATGGACTCTCAGATGCTGTTGTGTCTGTCGGG 3597 

IIIIIIIIIII llllllllllllll IIIII II II Illllllllllllllll III 
AAGACAGATGCTTATGGCCAAAGGGTTTATGGCCTATCGGATGCTGTTGTGTCTGTTGGG 3523 

TTTGAATATGAGACCTGTCCCAGTCTAATTCTCTGGGAGAAAAGGACAGCCCTCCTTCAG 3657 

IIIIIIIMIIIIIIII IIIIIIII II II IMIIIIIIIIIIIIIIIII Illlll 
TTTGAATATGAGACCTGCCCCAGTCTCATCCTGTGGGAGAAAAGGACAGCCCTGCTTCAG 3583 

GGATTCGAGCTGGACCCCTCCAACCTCGGTGGCTGGTCCCTAGACAAACACCACATCCTC 3717 

IIIIIIIMIIIIIIII IIIIIMI II IIIIIIIIIII IIIIIIIIIIMI MM 

GGATTCGAGCTGGACCCTTCCAACCTTGGAGGCTGGTCCCTGGACAAACACCACACCCTC 3643 

AATGTTAAAAGTGGAATCCTACACAAAGGCACTGGGGAAAACCAGTTCCTGACCCAGCAG 3777 

IIIII IIIII IIIII IIIIIIIIIII II IIIII llllllllllllllllillll 
AATGTGAAAAGCGGAATACTACACAAAGGGACAGGGGAGAACCAGTTCCTGACCCAGCAG 3703 
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QUERY: 3778 CCTGCCATCATCACCAGCATCATGGGCAATGGTCGCCGCCGGAG 

llllllllii^l IlilMilllllM IIMIMII I II 

SBJCT: 3704 CCTGCCATd^BLCGAGCATCATGGGCAACGGTCGCCGCAGAAG 



CCTGTCCCAGC 3837 

IIMIIillll 
CCTGTCCCAGC 3763 



3897 



QUERY : 3 83 8 TGCAACGGCCTTGCTGAAGGCAACAAGCTGCTGGCCCCAGTGGCTCTGGCTGTTGGAATC 

Mill lllllllllillllllllll III I Mill Mill IIIIIIM II III 

SBJCT: 3764 TGCAATGGCCTTGCTGAAGGCAACAAACTGTTAGCCCCTGTGGCCCTGGCTGTGGGGATC 3823 



QUERY: 3898 GATGGGAGCCTCTATGTGGGTGACTTCAATTACATCCGACGCATCTTTCCCTCTCGAAAT 3957 

IIMIIIIIIIII III lllllllllll M Mill IIIIIIIIIIIIIIIMIMI 

SBJCT: 3824 GATGGGAGCCTCTTTGTTGGTGACTTCAACTATATCCGGCGCATCTTTCCCTCTCGAAAT 3883 



QUERY: 3 958 GTGACCAGCATCTTGGAGTTACGAAATAAAGAGTTTAAACATAGCAACAACCCAGCACAC 4017 

llllllll IIIIIIIIIIIIIIIMIIIIMIIIMMIMIIIIIII Mill III! 

SBJCT: 3884 GTGACCAGTATCTTGGAGTTACGAAATAAAGAGTTTAAACATAGCAACAGCCCAGGACAC 3943 
QUERY: 4 018 AAGTACTACTTGGCAGTGGACCCCGTGTCCGGCTCGCTCTACGTGTCCGACACCAACAGC 4077 

llllllllllllll llllllllllll I Mill MIMIII II lllllllllli 

SBJCT: 3944 AAGTACTACTTGGCTGTGGACCCCGTGACTGGCTCACTCTACGTCTCTGACACCAACAGT 4003 
QUERY: 4078 AGGAGAATCTACCGCGTCAAGTCTCTGAGTGGAACCAAAGACCTGGCTGGGAATTCGGAA 413 7 

I IIIIIIIMI llllllllllllll III MMMMMMMM lllllllll 

SBJCT: 4004 CGCCGAATCTACCGAGTCAAGTCTCTGAGCGGAGCCAAAGACCTGGCTGGAAATTCGGAA 4 063 

QUERY: 4138 GTTGTGGCAGGGACGGGAGAGCAGTGTCTACCCTTTGATGAAGCCCGCTGCGGGGATGGA 4197 

llllllllllllll II II II IIIIIIMIIIIIIIIIIIIIIIIII lllllllll 
SBJCT: 4064 GTTGTGGCAGGGACTGGCGAACAATGTCTACCCTTTGATGAAGCCCGCTGTGGGGATGGA 4123 

QUERY: 4198 GGGAAGGCCATAGATGCAACCCTGATGAGCCCGAGAGGTATTGCAGTAGACAAGAATGGG 4257 

llllllll I II II llllllllllllll IIMIIIIIIIIIIIIIIIIIIIIIII 
SBJCT: 4124 GGGAAGGCTGTGGACGCCACCCTGATGAGCCCCAGAGGTATTGCAGTAGACAAGAATGGG 4183 

QUERY: 4258 CTCATGTACTTTGTCGATGCCACCATGATCCGGAAGGTTGACCAGAATGGAATCATCTCC 4317 

II IIMIIIIIM IIIIMIIIIIIIIIIIIIIIII Mill II IIIMIIIIIII 

SBJCT: 4184 CTTATGTACTTTGTTGATGCCACCATGATCCGGAAGGTGGACCAAAACGGAATCATCTCC 4243 

QUERY: 4318 ACCCTGCTGGGCTCCAATGACCTCACTGCCGTCCGGCCGCTGAGCTGTGATTCCAGCATG 4377 

llllllllllllllllllllllllll 11 Mill II lllllllllll II llllll 
SBJCT: 4244 ACCCTGCTGGGCTCCAATGACCTCACAGCTGTCCGACCACTGAGCTGTGACTCGAGCATG 4303 

QUERY: 43 78 GATGTAGCCCAGGTTCGTCTGGAGTGGCCAACAGACCTTGCTGTCAATCCCATGGATAAC 4437 

II II llllllll Mill II Mill llllllll II lllll llllllll III 
SBJCT: 4304 GACGTGGCCCAGGTCCGTCTAGAATGGCCGACAGACCTCGCCGTCAACCCCATGGACAAC 4363 

QUERY: 443 8 TCCTTGTATGTTCTAGAGAACAATGTCATCCTTCGAATCACCGAGAACCACCAAGTCAGC 4497 

III MM lllll llllllll llllllll II lllll lllllllllll llllll 
SBJCT: 43 64 TCCCTGTACGTTCTGGAGAACAACGTCATCCTGCGGATCACGGAGAACCACCAGGTCAGC 4423 

QUERY: 44 98 ATCATTGCGGGACGCCCCATGCACTGCCAAGTTCCTGGCATTGACTACTCACTCAGCAAA 4 557 

lllll llllllll II lllllllllll lllll lllll llllllll IMIIIIII 

SBJCT: 4424 ATCATCGCGGGACGGCCTATGCACTGCCAGGTTCCCGGCATCGACTACTCGCTCAGCAAA 4483 

QUERY: 4558 CTAGCCATTCACTCTGCCCTGGAGTCAGCCAGTGCCATTGCCATTTCTCACACTGGGGTC 4617 

II lllll llllllll lllll MIMIII lllllillllllllllllllllllll 
SBJCT: 44 84 CTCGCCATCCACTCTGCGCTGGAATCAGCCAGCGCCATTGCCATTTCTCACACTGGGGTG 4543 

QUERY: 4 618 CTCTACATCACTGAGACAGATGAGAAGAAGATTAACCGTCTACGCCAGGTAACAACCAAC 4677 

lllllllllllllllll II llllillllll Mill llllllli II II lllll 
SBJCT: 4 544 CTCTACATCACTGAGACGGACGAGAAGAAGATCAACCGCCTACGCCAAGTCACCACCAAT 4 603 

QUERY: 4678 GGGGAGATCTGCCTTTTAGCTGGGGCAGCCTCGGACTGCGACTGCAAT^CGATGTCAAT 4 737 

II IIIMIIIIII lllll lllll lllll lllll llllllllllllllllllll 
SBJCT: 4604 GGAGAGATCTGCCTCTTAGCCGGGGCGGCCTCAGACTGTGACTGCAAAAACGATGTCAAC 4663 

QUERY: 4738 TGCAACTGCTATTCAGGAGATGATGCCTACGCGACTGATGCCATCTTGAATTCCCCATCA 4797 

MM llllll II IIIIMM II lllll II 11 llllll MM II II II 

SBJCT: 4 664 TGCATCTGCTACTCGGGAGATGACGCTTACGCCACGGACGCCATCCTGAACTCGCCGTCC 4 723 

QUERY: 4 798 TCCTTAGCTGTAGCTCCAGATGGTACCATTTACATTGCAGACCTTGGAAATATTCGGATC 4857 
llllllll II lllll lllll lllll illllllllllllllll lllll llllll 
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SB JCT : 
QUERY: 
SBJCT: 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY : 
SBJCT: 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 



4 724 TCCTTAGCCGTGGCTCCGGATGGCACCATCTACATTGCAGACCTTi 



;19iA< 



TATCCGGATC 4 783 



4 858 AGGGCGGTC!^«SlAGAACAAGCCTGTTCTTAATGCCTTCAACCAG'3^EAGGCTGCATCC 4 917 

IIIIIIMIlTlll II II II llllllll II lllllllllllllllllllllll 
4784 AGGGCGGTCAGCAAAAATAAACCCGTTCTTAACGCATTCAACCAGTATGAGGCTGCATCT 4843 

4 918 CCCGGAGAGCAGGAGTTATATGTTTTCAACGCTGATGGCATCCACCAATACACTGTGAGC 4 977 

II Mill Mill II II II IIMIIIIIIIIII Mil! 11 lllllllllll 

4844 CCGGGAGAACAGGAATTGTACGTGTTCAACGCTGATGGTATCCATCAGTACACTGTGAGT 4 903 
4 978 CTGGTGACAGGGGAGTACTTGTACAATTTCACATATAGTACTGACAATGATGTCACTGAA 5037 

llllllll IIMIIIIIIIIIIIIIIIIIIIIII II llllllllll Mill II 

4 904 CTGGTGACTGGGGAGTACTTGTACAATTTCACATACAGCGCTGACAATGACGTCACCGAG 4963 
503 8 TTGATTGACAATAATGGGAATTCCCTGAAGATCCGTCGGGACAGCAGTGGCATGCCCCGT 50 97 

IMIIIIIIM II IIMIIIIIII llllllll IIIIIIIIIMIMIIIIIIIII 

4 964 TTGATTGACAACAACGGGAATTCCCTAAAGATCCGCCGGGACAGCAGTGGCATGCCCCGC 5023 
5098 CACCTGCTCATGCCTGACAACCAGATCATCACCCTCACCGTGGGCACCAATGGAGGCCTC 5157 

IIIMIIIIIIIII II II IIIM llllllll II llllllilMMIIIIIIIII 

5024 CACCTGCTCATGCCGGATAATCAGATTATCACCCTTACTGTGGGCACCAATGGAGGCCTC 5083 
5158 AAAGTCGTGTCCACACAGAACCTGGAGCTTGGTCTCATGACCTATGATGGCAACACTGGG 5217 

MM lllllllll IIIIIIIIIMIII M IIIIIIM llllllll IIIIMII 

5084 AAAGCCGTGTCCACTCAGAACCTGGAGCTGGGCCTCATGACTTATGATGGGAACACTGGA 5143 
5218 CTCCTGGCCACCAAGAGCGATGAAACAGGATGGACGACTTTCTATGACTATGACCACGAA 5277 

Mill lllllllllll llllllll llllllll Mill lllllllllllllllll 

5144 CTCCTAGCCACCAAGAGTGATGAAACCGGATGGACAACTTTTTATGACTATGACCACGAG 52 03 
5278 GGCCGCCTGACCAACGTGACGCGCCCCACGGGGGTGGTAACCAGTCTGCACCGGGAAATG 5337 

Mill llllllll Mill lllllllllll Mill lllllllllllllllllllll 

5204 GGCCGTCTGACCAATGTGACCCGCCCCACGGGCGTGGTGACCAGTCTGCACCGGGAAATG 5263 
53 3 8 GAGAAATCTATTACCATTGACATTGAGAACTCCAACCGTGATGATGACGTCACTGTCATC 53 97 

lllllllllll llllllllllllllllllllllllll lllllllllllllllll III 

5264 GAGAAATCTATCACCATTGACATTGAGAACTCCAACCGGGATGATGACGTCACTGTGATC 5323 
53 98 ACCAACCTCTCTTCAGTAGAGGCCTCCTACACAGTGGTACAAGATCAAGTTCGGAACAGC 54 57 

lllllllllll II II lllllllllll llllllllllllllllllll II llllll 

53 24 ACCAACCTCTCCTCCGTGGAGGCCTCCTATACAGTGGTACAAGATCAAGTGCGAAACAGC 53 83 

54 58 TACCAGCTCTGTAATAATGGTACCCTGAGGGTGATGTATGCTAATGGGATGGGTATCAGC 5517 

lllllllllll llllllll llllll llllllllll II II II MM I Mill 

53 84 TACCAGCTCTGCAATAATGGAACCCTGCGGGTGATGTACGCCAACGGCATGGCTGTCAGC 5443 
5518 TTCCACAGCGAGCCCCATGTCCTAGCGGGCACCATCACCCCCACCATTGGACGCTGCAAC 5577 

IMIIMI llllllll Mill II llllllllllllllllllll II llllllill 

5444 TTCCACAGTGAGCCCCACGTCCTCGCAGGCACCATCACCCCCACCATCGGGCGCTGCAAC 5503 
5578 ATCTCCCTGCCTATGGAGAATGGCTTAAACTCCATTGAGTGGCGCCTAAGAAAGGAACAG 5637 

Mill Mill IIMIIIIMII I IIIIMII lllllllllll II Mlllllll 

5504 ATCTCTCTGCCCATGGAGAATGGCCTGAACTCCATCGAGTGGCGCCTGAGGAAGGAACAG 5563 
5638 ATTAAAGGCAAAGTCACCATCTTTGGCAGGAAGCTCCGGGTCCATGGAAGAAATCTCTTG 5697 

II lllllllllllllllllllllll llllllll llllllll Mill llllll II 

5564 ATCAAAGGCAAAGTCACCATCTTTGGGAGGAAGCTTCGGGTCCACGGAAGGAATCTCCTG 5623 



5698 TCCATTGACTATGATCGAT^TATTCGGACTGAAAAGATCTATGATGACCACCGGAAGTTC 

llllllll Mill llllllll II II II IMIIMI llllllllllllll III 

5624 TCCATTGATTATGACCGAAATATCCGTACGGAGAAGATCTACGATGACCACCGGAAATTC 



5757 



5683 



5817 



5758 ACCCTGAGGATCATTTATGACCAGGTGGGCCGCCCCTTCCTCTGGCTGCCCAGCAGCGGG 

llllllllllllll IIIIIIIMIIIIIIIIIIIIIMII Mill II Mill III 

5684 ACCCTGAGGATCATCTATGACCAGGTGGGCCGCCCCTTCCTGTGGCTCCCGAGCAGTGGG 5743 
5818 CTGGCAGCTGTCAACGTGTCATACTTCTTCAATGGGCGCCTGGCTGGGCTTCAGCGTGGG 5877 

llllllll Mill II II IIIIIIIIIIIIIIIMI MM II II Mill III 

5744 CTGGCAGCCGTCAATGTCTCCTACTTCTTCAATGGGCGCTTGGCCGGCCTCCAGCGAGGG 5803 
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QUERY: 



SB JCT : 



5878 GCCATGAGCGAGAGGACAGACATCGACAAGCAAGGCCGCATCXSTG^ 

allllMMM IIMIIIIIIIIII mill 
GGACAGACATTGACAAGCAAGGCCGGATCGTG' 



CGCATGTTCGCT 5937 

llllllllll 

JGCATGTTCGCC 5863 



5997 



QUERY : 593 8 GACGGGAAAGTGTGGAGCTACTCCTACCTTGACAAGTCCATGGTCCTCCTGCTTCAGAGC 

llllllllill Mill II Mill IIIIMMIIIMIIMIM Mill llllll 

SB JCT: 5864 GACGGGAAAGTCTGGAGTTATTCCTATCTTGACAAGTCCATGGTCCTTCTGCTACAGAGC 5923 
QUERY: 5998 CAACGTCAGTATATATTTGAGTATGACTCCTCTGACCGCCTCCTTGCCGTCACCATGCCC 6057 

lllllllllll IIMMM IIIIIIIMII II lllllll II Mill IIMII 

SB JCT: 5924 CAACGTCAGTACATATTTGAATATGACTCCTCCGATCGCCTCCACGCAGTCACTATGCCC 5983 
QUERY: 6058 AGCGTGGCCCGGCACAGCATGTCCACACACACCTCCATCGGCTACATCCGTAATATTTAC 6117 

II II Mllllllllllllllllll MMIIIMII II MIIIIM II llllll 

SB JCT: 5984 AGTGTCGCCCGGCACAGCATGTCCACGCACACCTCCATTGGTTACATCCGAAACATTTAC 6043 
QUERY: 6118 AACCCGCCTGAAAGCAATGCTTCGGTCATCTTTGACTACAGTGATGACGGCCGCATCCTG 6177 

Mill II lllllllllll IIIIMMIIIIIMIIIIIMMIIIIIIIIIIIIII 

SB JCT: 6044 AACCCACCCGAAAGCAATGCATCGGTCATCTTTGACTACAGTGATGACGGCCGCATCCTA 6103 
QUERY: 6178 AAGACCTCCTTTTTGGGCACCGGACGCCAGGTGTTCTACAAGTATGGGAAACTCTCCAAG 6237 

Mill II II IIMMM II IIIIMIIIIIIIIIIIIIMII llllllllllll 

SB JCT: 6104 AAGACATCTTTCTTGGGCACTGGGCGCCAGGTGTTCTACAAGTATGGAAAACTCTCCAAG 6163 
QUERY: 6238 TTATCAGAGATTGTCTACGACAGTACCGCCGTCACCTTCGGGTATGACGAGACCACTGGT 6297 

lllllllllll lllllllllll M IMIIIIIIM IIIIIIIIIIIIIMII III 

SBJCT: 6164 TTATCAGAGATAGTCTACGACAGCACAGCCGTCACCTTTGGGTATGACGAGACCACCGGT 6223 

QUERY: 62 98 GTCTTGAAGATGGTCAACCTCCAAAGTGGGGGCTTCTCCTGCACCATCAGGTACCGGAAG 63 57 

III lllllllllllll lllllllllllllllllllllll IIIIIIIIIIMII III 
SBJCT: 6224 GTCCTGAAGATGGTCAATCTCCAAAGTGGGGGCTTCTCCTGTACCATCAGGTACCGAAAG 6283 

QUERY: 63 58 ATTGGCCCCCTGGTGGACAAGCAGATCTACAGGTTCTCCGAGGAAGGCATGGTCAATGCC 6417 

MM Mill llllllllllllll lllllllllll MIIIIM III MM III 

SBJCT: 6284 GTTGGGCCCCTTGTGGACAAGCAGATTTACAGGTTCTCTGAGGAAGGAATGATCAACGCC 6343 
QUERY: 6418 AGGTTTGACTACACCTATCATGACAACAGCTTCCGCATCGCAAGCATCAAGCCCGTCATA 6477 

MIIIIM II MIIIIM Mill lllllllllll II MIIIIM llllllll 

SBJCT: 6344 AGGTTTGATTATACCTATCACGACAATAGCTTCCGCATTGCCAGCATCAAACCCGTCATT 6403 

QUERY: 64 78 AGTGAGACTCCCCTCCCCGTTGACCTCTACCGCTATGATGAGATTTCTGGCAAGGTGGAA 653 7 

II IIIIIMIIII II IIIIIIIIIIIIIIIIIIM llllllll llllllllllll 
SBJCT: 6404 AGCGAGACTCCCCTTCCTGTTGACCTCTACCGCTATGACGAGATTTCCGGCAAGGTGGAA 6463 

QUERY: 653 8 CACTTTGGTAAGTTTGGAGTCATCTATTATGACATCAACCAGATCATCACCACTGCCGTG 6597 

Mill II llllllll llllllll II lllllllllllllllllllllllllllll 
SBJCT: 6464 CACTTCGGCAAGTTTGGGGTCATCTACTACGACATCAACCAGATCATCACCACTGCCGTC 6523 

QUERY: 6598 ATGACCCTCAGCAAACACTTCGACACCCATGGGCGGATCAAGGAGGTCCAGTATGAGATG 6657 

Mill II Mill Mill lllllllllllill llllllll II II lllllllll 

SBJCT: 6524 ATGACGCTTAGCAAGCACTTTGACACCCATGGGCGCATCAAGGAAGTGCAATATGAGATG 6583 

QUERY: 6658 TTCCGGTCCCTCATGTACTGGATGACGGTGCAATATGACAGCATGGGCAGGGTGATCAAG 6717 

llllllllllllllllllllllllll lllllillllllll lllil Mill llllll 
SBJCT: 6584 TTCCGGTCCCTCATGTACTGGATGACTGTGCAATATGACAGTATGGGTAGGGTCATCAAG 6643 

QUERY: 6718 AGGGAGCTAAAACTGGGGCCCTATGCCAATACCACGAAGTACACCTATGACTACGATGGG 6777 

Mill II Mill IIMMMMMM IIIM lllllllllllllllll II III 

SBJCT: 6644 AGGGAACTGAAACTAGGGCCCTATGCCAACACCACAAAGTACACCTATGACTATGACGGG 6703 

QUERY: 6778 GACGGGCAGCTCCAGAGCGTGGCCGTCAATGACCGCCCGACCTGGCGCTACAGCTATGAC 6837 

Mill lllllllllll lllllllllllllllll II lllllllllll lllllllll 
SBJCT: 6704 GACGGCCAGCTCCAGAGTGTGGCCGTCAATGACCGGCCTACCTGGCGCTATAGCTATGAC 6763 

QUERY: 6838 CTTAATGGGAATCTCCACTTACTGAACCCAGGCAACAGTGTGCGCCTCATGCCCTTGCGC 6897 

II IMIIIII II III I II llllllll lllllll llllllllllllll Mi 

SBJCT: 6764 CTCAATGGGAACCTGCACCTTCTAAACCCAGGAAACAGTGCTCGCCTCATGCCCTTACGC 6823 

QUERY: 6898 TATGACCTCCGGGATCGGATAACCAGACTCGGGGATGTGCAGTACAAAATTGACGACGAT 6957 
lllllllllll II lllllllllll II Mill llllllllllllll II llllll 
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SB JCT : 


6824 


QUERY: 


6958 


SB JCT : 


6884 


QUERY: 


7018 


SB JCT : 


6944 


QUERY: 


7078 


SB JCT : 


7004 


QUERY: 


7138 


SB JCT : 


7064 


QUERY : 


7198 


SB JCT : 


7124 


QUERY: 


7258 


SB JCT : 


7184 


QUERY : 


7318 


SB JCT : 


7244 


QUERY: 


7378 


SB JCT : 


7304 


QUERY : 


7438 


SB JCT : 


7364 


QUERY : 


7498 


SB JCT : 


7424 


QUERY : 


7558 


SB JCT : 


7484 


QUERY : 


7618 


SB JCT : 


7544 


QUERY : 


7678 


SB JCT : 


7604 


QUERY : 


. 7738 


SB JCT : 


7664 


QUERY : 


179Q 


SB JCT: 


7724 


QUERY : 


7858 


SB JCT: 


7784 


QUERY : 


7918 


SB JCT: 


7844 



TATGACCTCCGTGACCGGATAACCAGGCTAGGGGACGTGCAGTACAi^TCGATGACGAT 6883 




.CA^ATC 



GGCTATCTd^MlAGAGAGGGTCTGACATCTTCGAATACAATTCa^PKSCCTCCTAACA 7017 

mill I iTTTiii llllllll llllllll llllllll Illllllllll M 11 

GGCTATTTGTGCCAGAGAGGGTCAGACATCTTTGAATACAACTCCAAGGGCCTTCTGACG 694 3 

AGAGCCTACAACAAGGCCAGCGGGTGGAGTGTCCAGTACCGCTATGATGGCGTAGGACGG 7077 

IIIII lilllllllllllllll IIIII II IIIIIIIIIIIIII II II II II 

AGAGCATACAACAAGGCCAGCGGATGGAGCGTGCAGTACCGCTATGACGGAGTGGGCCGC 7003 

CGGGCTTCCTACAAGACCAACCTGGGCCACCACCTGCAGTACTTCTACTCTGACCTCCAC 7137 

llllllllllllllllllilMlllilllllllll IIIIIIIIIIIIII illllllll 

CGGGCTTCCTACAAGACCAACCTGGGCCACCACCTACAGTACTTCTACTCCGACCTCCAC 7063 

AACCCGACGCGCATCACCCATGTCTACAATCACTCCAACTCGGAGATTACCTCACTGTAC 7197 

IIIII II II Illllllllll IIIII Illllllllll IIIII IIIII II III 

AACCCCACACGTATCACCCATGTTTACAACCACTCCAACTCTGAGATCACCTCGCTCTAC 7123 

TACGACCTCCAGGGCCACCTCTTTGCCATGGAGAGCAGCAGTGGGGAGGAGTACTATGTT 7257 

II llllllllllillllll IIIIIMIIIIIIIill IIIII II II IIIIIMI 

TATGACCTCCAGGGCCACCTATTTGCCATGGAGAGCAGTAGTGGTGAAGAATACTATGTC 7183 

GCCTCTGATAACACAGGGACTCCTCTGGCTGTGTTCAGCATCAACGGCCTCATGATCAAA 7317 

IIIII II IIIII IIIII 1 1 II nil I nil III IIIII iiiini llllll I 

GCCTCAGACAACACGGGGACCCCTCTGGCTGTGTACAGTATCAATGGCCTCATGATCAAG 7243 

CAGCTGCAGTACACGGCCTATGGGGAGATTTATTATGACTCCAACCCCGACTTCCAGATG 73 77 

II Illllllllll IIIIIIIIIIIIII II Illllllllll II niiiiiiiiii 

CAACTGCAGTACACAGCCTATGGGGAGATCTACTATGACTCCAATCCAGACTTCCAGATG 73 03 

GTCATTGGCTTCCATGGGGGACTCTATGACCCCCTGACCAAGCTGGTCCACTTCACTCAG 7437 

niiiiiiiiiiii II II IIIIIIIIIIIIII llllllll llllllll IIIII 

GTCATTGGCTTCCACGGAGGCCTCTATGACCCCCTCACCAAGCTCGTCCACTTTACTCAA 73 63 

CGTGATTATGATGTGCTGGCAGGACGATGGACCTCCCCAGACTATACCATGTGGAAAAAC 74 97 

Illllllllll IIIIIIIIIIIIII IIIII IIIII IIIII llllllllll III 

CGTGATTATGACGTGCTGGCAGGACGGTGGACGTCCCCCGACTACACCATGTGGAGGAAC 7423 

GTGGGCAAGGAGCCGGCCCCCTTTAACCTGTATATGTTCAAGAGCAACAATCCTCTCAGC 7557 

IIIIIIIIIIIIII niiiiii iiiiiiii llllllllll iiiiiiiiiiii III 

GTGGGCAAGGAGCCAGCCCCCTTCAACCTGTACATGTTCAAGAACAACAATCCTCTGAGC 74 83 

AGTGAGCTAGATTTGAAGAACTACGTGACAGATGTGAAAAGCTGGCTTGTGATGTTTGGA 7617 

I llllll 11 II lllllllllinillll IIIII lllllllllllllllllllll 
AATGAGCTGGACTTAAAGAACTACGTGACAGACGTGAAGAGCTGGCTTGTGATGTTTGGA 7543 

TTTCAGCTTAGCAACATCATTCCTGGCTTCCCGAGAGCCAAAATGTATTTCGTGCCTCCT 7677 

llllllll lllinillllllllll lllllllllllllllllllllll llllllll 

TTTCAGCTCAGCAACATCATTCCTGGATTCCCGAGAGCCAAAATGTATTTTGTGCCTCCC 7603 

CCCTATGAATTGTCAGAGAGTCAAGCAAGTGAGAATGGACAGCTCATTACAGGTGTCCAA 7737 

Illllllll lllllllllllllllllll IIIII iiiniiiiiiiiiiiiiiiiii 

CCCTATGAACTGTCAGAGAGTCAAGCAAGCGAGAACGGACAGCTCATTACAGGTGTCCAG 7663 

CAGACAACAGAGAGACATAACCAGGCCTTCATGGCTCTGGAAGGACAGGTCATTACTAAA 7797 

llllllll IIIII lllllllllllllll inillllilllllllllllll llllll 

CAGACAACTGAGAGGCATAACCAGGCCTTCCTGGCTCTGGAAGGACAGGTCATCACTAAA 7723 

AAGCTCCACGCCAGCATCCGAGAGAAAGCAGGTCACTGGTTTGCCACCACCACGCCCATC 7857 

llilllll lllilllllllllllllllllll Illllllllll nilllll llllll 

AAGCTCCATGCCAGCATCCGAGAGAAAGCAGGCCACTGGTTTGCTACCACCACACCCATC 7783 

ATTGGCAAAGGCATCATGTTTGCCATCAAAGAAGGGCGGGTGACCACGGGCGTGTCCAGC 7917 

II llllllllllllllllllllllllllllllllllllllllllll II IIIII III 
ATCGGCAAAGGCATCATGTTTGCCATCAAAGAAGGGCGGGTGACCACAGGAGTGTCTAGC 7843 

ATCGCCAGCGAAGATAGCCGCAAGGTGGCATCTGTGCTGAACAACGCCTACTACCTGGAC 7977 

iiiinii II II iininiiii inn iii iiiiiii iiiiiiiii i in 

ATCGCCAGTGAGGACAGCCGCAAGGTAGCATCCGTGTTGAACAATGCCTACTACTTAGAC 7903 
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QUERY: 


7978 


SB JCT : 


7904 


QUERY: 


8038 


SBJCT : 


7964 


QUERY: 


8098 


SBJCT : 


8024 


QUERY: 


8158 


SBJCT : 


8084 


QUERY : 


8218 


SBJCT : 


8144 


QUERY: 


8278 


SBJCT : 


8204 


QUERY: 


8338 


SBJCT : 


8264 


QUERY: 


8398 


SBJCT: 


8324 


QUERY: 


8458 


SBJCT : 


8384 


QUERY : 


8518 


SBJCT : 


8444 


QUERY: 


8578 


SBJCT : 


8503 


QUERY: 


8638 


SBJCT: 


8563 


SCORE 


= 15*: 



AAGATGCACTACAGCATCGAGGGCAAGGACACCCACTACTTTGTG. 
Illlllllg^lllllllllllllllllll IMIIIIMIII 

aagatgcacSMIIgcatcgagggcaaggacacacactactttgtg: 



TTGGCTCAGCC 8037 

Ml I II 

TCGGCGCCGCG 7963 



GATGGCGACCTGGTCACACTAGGCACCACCATCGGCCGCAAGGTGCTAGAGAGCGGGGTG 8097 

Mill IIIIIIIIIII IIIII IIIIIIII II IMIIMIIII IIIII llllll 

GATGGTGACCTGGTCACGCTAGGAACCACCATTGGGCGCAAGGTGCTGGAGAGTGGGGTG 8023 
AACGTGACCGTGTCCCAGCCCACGCTGCTGGTCAACGGCAGGACTCGAAGGTTCACGAAC 8157 

IIIIIIII IIIII IIIIIIIIIIIIMIII II IMIIIIIIIIIIIIMIII III 

AACGTGACGGTGTCACAGCCCACGCTGCTGGTGAATGGCAGGACTCGAAGGTTCACCAAC 8083 
ATTGAGTTCCAGTACTCCACGCTGCTGCTCAGCATCCGCTATGGCCTCACCCCCGACACC 8217 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI IIIIIIII IIIIIIIIMIMIIII 

ATTGAGTTCCAGTACTCCACGCTGCTGCTCAGTATCCGCTACGGCCTCACCCCCGACACG 814 3 
CTGGACGAAGAGAAGGCCCGCGTCCTGGACCAGGCGAGACAGAGGGCCCTGGGCACGGCC 8277 

lllllilllll IIIIIIIIMIIIIIIIIII III lllllll IIIIIIII II Ml 

CTGGACGAAGAAAAGGCCCGCGTCCTGGACCAAGCGGGACAGAGAGCCCTGGGTACTGCC 8203 
TGGGCCAAGGAGCAGCAGAAAGCCAGGGACGGGAGAGAGGGGAGCCGCCTGTGGACTGAG 8337 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII IMIIIIIIIIIII III 

TGGGCCAAGGAGCAGCAGAAAGCCAGGGACGGGAGAGAGGGCAGCCGCCTGTGGACGGAG 8263 
GGCGAGAAGCAGCAGCTTCTGAGCACCGGGCGCGTGCAAGGGTACGAGGGATATTACGTG 8397 

lllllllillllll II IIIIIIII II II II IIIII II Mill IIIIIIII 

GGCGAGAAGCAGCAACTCCTGAGCACGGGACGGGTACAAGGTTATGAGGGCTATTACGTA 8323 
CTTCCCGTGGAGCAATACCCAGAGCTTGCAGACAGTAGCAGCAACATCCAGTTTTTAAGA 8457 

IIIII IIIII II IIIII IIIII I'lllllllllllllllMIIIMIII llllll 

CTTCCGGTGGAACAGTACCCGGAGCTGGCAGACAGTAGCAGCAACATCCAGTTCTTAAGA 83 83 
CAGAATGAGATGGGAAAGAGGTAACAAAATAATCTGCTGCCATTCCTTGTCTGAATGGCT 8517 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIII lllllllll III MM IIIII 

CAGAATGAGATGGGAAAGAGGTAACAAAATAACCTGCTGCCACCTCTTCTCTGGGTGGCT 8443 
CAGCAGGAGTAACTGTTATCTCCTCTCCTAAGGAGATGAAGACCTAACAGGGGCACTGCG 8577 

lllllllll llllll I IIIIIMIIIIIIIIII IIIIIIIIIII IIIIIIMI I 

CAGCAGGAGCAACTGTGACCTCCTCTCCTAAGGAGACGAAGACCTAAC - GGGGCACTGAG 8502 
GCTGGGCTGCTTTAGGAGACCAAGTGGCAAGAAAGCTCACATTTTTTGAGTTCAAATGCT 863 7 

II llllllllllllll IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

GCCGGGCTGCTTTAGGATCCCAAGTGGCAAGAAAGCTCACATTTTTTGAGTTCAAATGCT 8562 
ACTGTCCAAGCGAGAAGTCCCTCATCCTGAAGTAGACTAAAGCCCGGC 8685 

llllll IIIII lllllllllllllllllllllllll IIMIIII 

ACTGTCTAAGCGCAAAGTCCCTCATCCTGAAGTAGACTAGAGCCCGGC 8610 



IDENTITIES = 1095/1196 (91%) 
STRAND = PLUS / PLUS 

QUERY : 270 ATCTGGAATAATGGATGTAAAGGACCGGCGACACCGCTCTTTGACCAGAGGACGCTGTGG 32 9 

lllllllllllllllllllllllllllllllll IIIIIIIIMIIII IIIII IIIII 

SBJCT: 103 ATCTGGAATAATGGATGTAAAGGACCGGCGACATCGCTCTTTGACCAGGGGACGGTGTGG 162 



QUERY: 330 

SBJCT: 163 

QUERY: 3 90 

SBJCT: 223 

QUERY: 4 50 

SBJCT: 283 

QUERY: 510 



CAAAGAGTGTCGCTACACAAGCTCCTCTCTGGACAGTGAGGACTGCCGGGTGCCCACACA 389 

iiiiiiiiiiiiiiiiii iiiiiiiiiiiiiiiiiiiiiiiiiiiii IIIIIIII II 

CAAAGAGTGTCGCTACACCAGCTCCTCTCTGGACAGTGAGGACTGCCGTGTGCCCACTCA 222 
GAAATCCTACAGCTCCAGTGAGACTCTGAAGGCCTATGACCATGACAGCAGGATGCACTA 44 9 

ill IIIIIIII IIIIIIIIIII lllllll lllllllllllllllll IIMIIII 

GAAGTCCTACAGTTCCAGTGAGACCTTGAAGGCTTATGACCATGACAGCAGAATGCACTA 282 
TGGAAACCGAGTCACAGACCTCATCCACCGGGAGTCAGATGAGTTTCCTAGACAAGGAAC 509 

llllillllllllllllllll I IIIIIIIIIII lllllllll llllllllll II 

TGGAAACCGAGTCACAGACCTGGTGCACCGGGAGTCCGATGAGTTTTCTAGACAAGGGAC 342 
CAACTTCACCCTTGCCGAACTGGGCATCTGTGAGCCCTCCCCACACCGAAGCGGCTACTG 569 

IIIIIIIIIII II III MM IIIII llllllllllllllllllll II IIIII 
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SB JCT : 


343 


QUERY: 


570 


SB JCT : 


403 


QUERY: 


630 


SB JCT : 


463 


QUERY: 


690 


SBJCT : 


523 


QUERY: 


750 


SBJCT : 


583 


QUERY: 


810 


SBJCT : 


643 


QUERY: 


870 


SBJCT : 


703 


QUERY : 


930 


SBJCT : 


763 


QUERY: 


990 


SBJCT : 


823 


QUERY: 


1050 


SBJCT : 


883 


QUERY: 


1110 


SBJCT : 


943 


QUERY : 


1170 


SBJCT : 


1003 


QUERY: 


1230 


SBJCT : 


1063 


QUERY : 


1290 


SBJCT : 


1123 


QUERY: 


1350 


SBJCT: 


1183 


QUERY : 


1410 


SBJCT: 


1243 



AAACTTCACCCTGGCAGAATTGGGAATCTGCGAGCCCTCCCCACACJ 



CTCCGACAT^HATCCTTCACCAGGGCTACTCCCTTAGCACAGGfl^PGACGCCGACTC 

IMIMIITT Mill IIIIIIIIIIIM'IIII illll llllllll II Mill 

TTCCGACATGGGTATCCTCCACCAGGGCTACTCCCTGAGCACTGGGTCTGATGCAGACTC 462 
CGACACCGAGGGAGGGATGTCTCCAGAACACGCCATCAGACTGTGGGGCAGAGGGATAAA 689 

llllllllllillMIIIIIIMMIMI IIIIIIIIIIIMIIII IMMIIIII 

GGACACCGAGGGAGGGATGTCTCCAGAACATGCCATCAGACTGTGGGGACGAGGGATAAA 522 

ATCCAGGCGCAGTTCCGGCCTGTCCAGTCGTGAAAACTCGGCCCTTACCCTGACTGACTC 74 9 

llllllllllll II III IMIIII II II IIIIIIIIIIIIM IMIMIIIII 
ATCCAGGCGCAGCTCTGGCTTGTCCAGCCGCGAGAACTCGGCCCTTACTCTGACTGACTC 582 

TGACAACGAAAACAAATCAGATGATGAGAACGGTCGTCCCATTCCACCTACATCCTCGCC 809 

MUM IIIM MIM MIM II II M 1 1 M M 1 1 II I M II II 1 1 1 1 1 M I I 

TGACAATGAAAATAAATCGGATGACGACAATGGTCGTCCCATTCCACCTACATCCTCGTC 642 
TAGTCTCCTCCCATCTGCTCAGCTGCCTAGCTCCCATAATCCTCCACCAGTTAGCTGCCA 869 

Ml MIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMMMIIIIIIIIIIIIIIIII 

TAGCCTCCTCCCATCTGCTCAGCTGCCTAGCTCCCATAATCCTCCACCAGTTAGCTGCCA 702 
GATGCCATTGCTAGACAGCAACACCTCCCATCAAATCATGGACACCAACCCTGATGAGGA 929 

IIIIMIIIIIIIIIMMMIIIIIIIIIIII IIMIMIMMIIIIIIIIIIIIII 

GATGCCATTGCTAGACAGCAACACCTCCCATCAGATCATGGACACCAACCCTGATGAGGA 762 



.CACmAAl 

.ggI^Pg 



GTGGTTACTG 402 



629 



ATTCTCCCCCAATTCATACCTGCTCAGAGCATGCTCAGGGCCCCAGCAAGCCTCCAGCAG 

MM Mill IMIMIIIIIIIIIIIII II Illll Mill Ml Illll IMIIII II I II 

ATTCTCCCCCAATTCATACCTGCTCAGAGCATGCTCAGGGCCCCAGCAAGCCTCCAGCAG 



989 



822 



TGGCCCTCCGAACCACCACAGCCAGTCGACTCTGAGGCCCCCTCTCCCACCCCCTCACAA 104 9 

lllllllll IMIIIIIIIIIMIII II llllllllllllll lllllllllll II 
TGGCCCTCCAAACCACCACAGCCAGTCAACACTGAGGCCCCCTCTGCCACCCCCTCATAA 8 82 

CCACACGCTGTCCCATCACCACTCGTCCGCCAACTCCCTCAACAGGAACTCACTGACCAA 1109 

MUM IIIMIII IIIMIM II IIIIIIIIMIIIMIIIIilllllMllill 

CCACACCCTGTCCCACCACCACTCCTCGGCCAACTCCCTCAACAGGAACTCACTGACCAA 942 

TCGGCGGAGTCAGATCCACGCCCCGGCCCCAGCGCCCAATGACCTGGCCACCACACCAGA 1169 

IIIIIIMIIII IIIIIMIIII II II illlllll llllllllllllll Illll 
TCGGCGGAGTCAAATCCACGCCCCAGCTCCTGCGCCCAACGACCTGGCCACCACCCCAGA 1002 

GTCCGTTCAGCTTCAGGACAGCTGGGTGCTAAACAGCAACGTGCCACTGGAGACCCGGCA 122 9 

III llllllll Illll lllllllllll Illll Illll lllllllllll Illll 
GTCTGTTCAGCTCCAGGATAGCTGGGTGCTGAACAGTAACGTCCCACTGGAGACTCGGCA 1062 

CTTCCTCTTCAAGACCTCCTCGGGGAGCACACCCTTGTTCAGCAGCTCTTCCCCGGGATA 1289 

llllll Illll II II II II lllllllll IIIIIMIIIIIIIII llllllll 
CTTCCTTTTCAAAACGTCGTCTGGAAGCACACCCCTGTTCAGCAGCTCTTCTCCGGGATA 1122 

CCCTTTGACCTCAGGAACGGTTTACACGCCCCCGCCCCGCCTGCTGCCCAGGAATACTTT 1349 

lllllllllllllll II Illll II II II llllllllllllll IMIIII II 
CCCTTTGACCTCAGGGACCGTTTATACACCACCACCCCGCCTGCTGCCACGGAATACATT 1182 

CTCCAGGAAGGCTTTCAAGCTGAAGAAGCCCTCCAAATACTGCAGCTGGAAATGTGCTGC 1409 

Mllllllllll IIIIIIIMIIIII lllllllllllllllll Mllllllllllll 

CTCCAGGAAGGCCTTCAAGCTGAAGAAACCCTCCAAATACTGCAGTTGGAAATGTGCTGC 1242 
CCTCTCCGCCATTGCCGCGGCCCTCCTCTTGGCTATTTTGCTGGCGTATTTCATAG 1465 

III II Mill IIIM IIIIIIIIIIIIM IIIIMIIIII llllllllll 

CCTGTCTGCCATCGCCGCCGCCCTCCTCTTGGCCATTTTGCTGGCATATTTCATAG 1298 



SCORE = 1455 BITS (734), EXPECT = 0.0 
IDENTITIES = 1000/1088 (91%), GAPS = 3/1088 (0%) 
STRAND = PLUS / PLUS 



QUERY: 1464 AGTGCCCTGGTCGTTGAAAAACAGCAGCATAGACAGTGGTGAAGCAGAAGTTGGTCGGCG 1523 

llllllllllll llllllllllllllllllllllllll lllllllllllllllllill 

SBJCT: 1420 AGTGCCCTGGTCATTGAAAAACAGCAGCATAGACAGTGGCGAAGCAGAAGTTGGTCGGCG 1479 
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QUERY : 


1524 


SB JCT : 


1480 


QUERY: 


1584 


SB JCT : 


1540 


QUERY: 


1644 


SBJCT: 


1600 


QUERY: 


1704 


SBJCT: 


1660 


QUERY : 


1764 


SBJCT : 


1720 


QUERY : 


1824 


SBJCT : 


1780 


QUERY: 


1884 


SBJCT : 


1840 


QUERY: 


1944 


SBJCT : 


1900 


QUERY: 


2004 


SBJCT : 


1960 


QUERY: 


2064 


SBJCT : 


2020 


QUERY : 


2124 


SBJCT : 


2080 


QUERY: 


2184 


SBJCT : 


2140 


QUERY: 


2244 


SBJCT : 


2200 


QUERY : 


2304 


SBJCT : 


2260 


QUERY : 


2364 


SBJCT : 


2320 


QUERY : 


2424 


SBJCT : 


2377 


QUERY: 


2484 


SBJCT : 


2437 


QUERY: 


2544 



GGTAACACAAGAAGTCCCACCAGGGGTGTTTTGGAGGTCACAAATIjj 

Ml lllllJIlllllMIIMIIIIIIIMIIIIIil 11 I 
ggtgacaq^MLgtcccaccaggggtgttttggaggtcccaga' 



T TCACA ' 

A' 

.i^PEa' 



.CATCAGTCAGCC 1583 

IIMIM 

.TCAGTCAGCC 153 9 



CCAGTTCTTAAAGTTCAACATCTCCCTCGGGAAGGACGCTCTCTTTGGTGTTTACATAAG 1643 

il IIIIIIIIIIIIIIIIIMIIII II Mill II Mill lllli II MM! 

TCAATTCTTAAAGTTCAACATCTCCCTGGGCAAGGATGCCCTCTTCGGTGTCTATATAAG 1599 

AAGAGGACTTCCACCATCTCATGCCCAGTATGACTTCATGGAACGTCTGGACGGGAAGGA 1703 

IIMIIII Mill lllllllllllllllllllllllllllll Mill II Mill 
GAGAGGACTACCACCGTCTCATGCCCAGTATGACTTCATGGAACGCCTGGATGGAAAGGA 1659 

GAAGTGGAGTGTGGTTGAGTCTCCCAGGGAACGCCGGAGCATACAGACCTTGGTTCAGAA 1763 

III Mill Mill Mill IIIIIIIIIIIIIMIIIII MM! MM MM! 

GAAATGGAGCGTGGTCGAGTCGCCCAGGGAACGCCGGAGCATCCAGACTCTGGTGCAGAA 1719 
TGAAGCCGTGTTTGTGCAGTACCTGGATGTGGGCCTGTGGCATCTGGCCTTCTACAATGA 1823 

II II lllllllllllllll lllllllllllllllllll lllllllllllllllll 

CGAGGCTGTGTTTGTGCAGTACTTGGATGTGGGCCTGTGGCACCTGGCCTTCTACAATGA 1779 

TGGAAAAGACAAAGAGATGGTTTCCTTCAATACTGTTGTCCTAGATTCAGTGCAGGACTG 1883 

II M Mill IIIIMII llllllll lllllllll lllllllllllllllllll 
CGGCAAGGACAAGGAGATGGTCTCCTTCAACACTGTTGTCTTAGATTCAGTGCAGGACTG 1839 

TCCACGTAACTGCCATGGGAATGGTGAATGTGTGTCCGGGGTGTGTCACTGTTTCCCAGG 1943 

MUM MM! II Mill Mllllll Mill II lllllllllllllllllll 

TCCACGGAACTGTCACGGGAACGGTGAATGCGTGTCTGGACTGTGTCACTGTTTCCCAGG 1899 
ATTTCTAGGAGCAGACTGTGCTAAAGCTGCCTGCCCTGTCCTGTGCAGTGGGAATGGACA 2003 

III Mill MIIIIIIIIIIMIIIIIIIIIIIIIII llllllll II llllllll 

ATTCCTAGGTGCAGACTGTGCTAAAGCTGCCTGCCCTGTACTGTGCAGCGGAAATGGACA 1959 
ATATTCTAAAGGGACGTGCCAGTGCTACAGCGGCTGGAAAGGTGCAGAGTGCGACGTGCC 2063 

IIIMIIIIII IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII II Mill 

GTATTCTAAAGGAACGTGCCAGTGCTACAGCGGCTGGAAAGGTGCAGAGTGTGATGTGCC 2019 
CATGAATCAGTGCATCGATCCTTCCTGCGGGGGCCACGGCTCCTGCATTGATGGGAACTG 2123 

Mill II II llllllllllllll llllllll IIIIIIIIIMIIIIIMIIIII 

TATGAACCAATGTATCGATCCTTCCTGTGGGGGCCATGGCTCCTGCATTGATGGGAACTG 2079 
TGTCTGCTCTGCTGGCTACAAAGGCGAGCACTGTGAGGAAGTTGATTGCTTGGATCCCAC 2183 

II II lllllllllllll MMMMIMMMIMMMIIIMIMIMM II 

CGTGTGTGCTGCTGGCTACAAGGGCGAGCACTGTGAGGAAGTTGATTGCTTGGATCCTAC 2139 



llllllllllll II IIIIIIIIIMIIMII II II IIMIIII lllllllllll 



TCTGAACTGTGAGCTGGCGAGGGTCCAGTGCCCAGACCAGTGCAGTGGGCATGGCACGTA 23 03 

III lllilllMIIMIIIIIIIIIIIIIIIIIIIIIIIII llllllllllllll M 

TCTCAACTGTGAGCTGGCGAGGGTCCAGTGCCCAGACCAGTGTAGTGGGCATGGCACTTA 2259 

CCTGCCTGACACGGGCCTCTGCAGCTGCGATCCCAACTGGATGGGTCCCGACTGCTCTGT 2363 

III IIIMI I MIIIIIIIIMII Mill IIIIIIIIIMIIIIIIIIIIIIIII 

CCTCCCTGACTCCGGCCTCTGCAGCTGTGATCCGAACTGGATGGGTCCCGACTGCTCTGT 2319 

TGAAGTGTGCTCAGTAGACTGTGGCACTCACGGCGTCTGCATCGGGGGAGCCTGCCGCTG 2423 

I IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

T GTGTGCTCAGTAGACTGTGGCACTCACGGCGTCTGCATCGGGGGAGCCTGCCGCTG 2376 

TGAAGAGGGCTGGACAGGCGCAGCGTGTGACCAGCGCGTGTGCCACCCCCGCTGCATTGA 24 83 

IIIIIIIIIIIIIIIIMIIIIII lllllllllllillllllllllllMIIIIIIIII 

TGAAGAGGGCTGGACAGGCGCAGCTTGTGACCAGCGCGTGTGCCACCCCCGCTGCATTGA 2436 

GCACGGGACCTGTAAAGATGGCAAATGTGAATGCCGAGAGGGCTGGAATGGTGAACACTG 2543 

II lllllllll Mill II IIIMI III llllllllllll IIIMI Mill Mill II III 

GCACGGGACCTGTAAAGATGGCAAATGTGAATGCCGAGAGGGCTGGAATGGTGAACACTG 24 96 



CACCATTG 2551 
llllllll 
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SBJCT: 2497 CACCATTG 2504 
SCORE = 105 BITS ( EXPECT = 5E-19 

IDENTITIES = 81/89 , GAPS = 1/89 (1%) 

STRAND = PLUS / PLUS 



QUERY: 


8711 


SBJCT: 


8655 


QUERY: 


8771 


SBJCT: 


8715 


SCORE 


= 48. 



Illlll lllllll I Mlllll llllil IIIIIIIIIIIIIIIIIIIIIIMIIII 



III IIMII I IIIIMIMIIIIIII 



IDENTITIES = 30/32 (93%) 
STRAND = PLUS / PLUS 

QUERY: 475 CACCGGGAGTCAGATGAGTTTCCTAGACAAGG 506 

IIIIIIIMII llillllll IIIIIIMII 
SBJCT: 7 CACCGGGAGTCCGATGAGTTTTCTAGACAAGG 38 

In this search it was also found that the FCTR3bcd and e nucleic acids had homology to 
three fragments of Rattus norvegicus neurestin alpha. It has 5498 of 6132 bases (89%) identical 
to bases 2527-8658, 1081 of 1 196 bases (90%) identical to bases 123-1318, 996 of 1088 bases 
(91%) identical to bases 1440-2527 of Rattus norvegicus neurestin alpha (GenBank 
Acc:NM_020088.1) (Table 3N). 

Table 3N. BLASTN of FCTR3b, c, d, and e against Rattus norvegicus Neurestin alpha 

mRNA (SEQ ID NO:66) 

> GI|9910319|REF|NM 020088. l| RATTUS NORVEGICUS NEURESTIN ALPHA (L0C56762) , MRNA 
LENGTH = 8689 

SCORE = 7129 BITS (3596), EXPECT =0.0 
IDENTITIES = 5498/6132 (89%) 
STRAND = PLUS / PLUS 

GATGGCTGCCCTGACTTGTGCAACGGTAACGGGAGATGCACACTGGGTCAGAACAGCTGG 2637 

llllllllllllll IIIIIIIIIMIIIIIIIIIIIIIMIIIIIIillllllllllll 
GATGGCTGCCCTGATTTGTGCAACGGTAACGGGAGATGCACACTGGGTCAGAACAGCTGG 2586 

CAGTGTGTCTGCCAGACCGGCTGGAGAGGGCCCGGATGCAACGTTGCCATGGAAACTTCC 2697 

1 1 Illlll III I.I MM I II Ml III II 1 1 II II I II II II II I III 1 1 II II II I III 

CAGTGTGTCTGCCAGACCGGCTGGAGAGGGCCCGGATGCAACGTTGCCATGGAAACCTCC 2646 

TGTGCTGATAACAAGGATAATGAGGGAGATGGCCTGGTGGATTGTTTGGACCCTGACTGC 2757 

II MIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIM II llllllllllllll 

TGCGCTGATAACAAGGATAATGAGGGAGATGGCCTGGTGGACTGCCTGGACCCTGACTGC 2706 

TGCCTGCAGTCAGCCTGTCAGAACAGCCTGCTCTGCCGGGGGTCCCGGGACCCACTGGAC 2 817 

Mill IIIIIIIIIIIIIIIIIIIIIIIIIIIII IIIIIIII IIIIIIII Mill 
TGCCTCCAGTCAGCCTGTCAGAACAGCCTGCTCTGTCGGGGGTCTCGGGACCCCTTGGAC 2766 

ATCATTCAGCAGGGCCAGACGGATTGGCCCGCAGTGAAGTCCTTCTATGACCGTATCAAG 2877 

Illllllllll lllllill M Mill II lllllllllllllllll lllllllli 
ATCATTCAGCAAGGCCAGACAGACTGGCCTGCGGTGAAGTCCTTCTATGATCGTATCAAG 2826 

CTCTTGGCAGGCAAGGATAGCACCCACATCATTCCTGGAGAGAACCCTTTCAACAGCAGC 2 937 

IIIIIIIIIIIIIIMI IIIIIIIIIMIIIIIIIIIIII Mill Mill Illlll 
CTCTTGGCAGGCAAGGACAGCACCCACATCATTCCTGGAGACAACCCCTTCAATAGCAGC 2886 

TTGGTTTCTCTCATCCGAGGCCAAGTAGTAACTACAGATGGAACTCCCCTGGTCGGTGTG 2997 

MM Mill llllllllllllllllllll II IMII M IIIIIIII Illlll 

CTGGTGTCTCTGATCCGAGGCCAAGTAGTAACCACGGATGGGACCCCCCTGGTGGGTGTG 2946 



QUERY: 


2578 


SBJCT : 


2527 


QUERY: 


2638 


SBJCT: 


2587 


QUERY: 


2698 


SBJCT: 


2647 


QUERY: 


2758 


SBJCT: 


2707 


QUERY : 


2818 


SBJCT : 


2767 


QUERY: 


2878 


SBJCT: 


2827 


QUERY: 


2938 


SBJCT : 


2887 
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QUERY: 


2998 


SB JCT : 


2947 


QUERY: 


3058 


SB JCT : 


3007 


QUERY: 


3118 


SB JCT : 


3067 


QUERY: 


3178 


SB JCT : 


3127 


QUERY: 


3238 


SB JCT : 


3187 


QUERY: 


3298 


SB JCT : 


3247 


QUERY : 


3358 


SB JCT : 


3307 


QUERY : 


3418 


SB JCT : 


3367 


QUERY : 


3478 


SB JCT : 


3427 


QUERY : 


3538 


SB JCT : 


3487 


QUERY : 


3598 


SB JCT : 


3547 


QUERY : 


3658 


SB JCT : 


3607 


QUERY : 


3718 


SB JCT : 


3667 


QUERY: 


3778 


SB JCT: 


3727 


QUERY: 


3838 


SB JCT : 


3787 


QUERY: 


3898 


SBJCT : 


3847 


QUERY : 


3958 


SBJCT : 


3907 


QUERY: 


4018 



AACGTGTC 

M MM 

AATGTGTCT 



ttWg' 



TCAAGTACCCAAAATACGGCTACACCATCACC^^kAGGATGGCACG 3057 

M M M M M M M M M M M M M M M Wl Ml M M M 

TCAAGTACCCAAAATATGGCTACACCATCACTCGCCAGGACGGCACG 3006 



TTCGACCTGATCGCAAATGGAGGTGCTTCCTTGACTCTACACTTTGAGCGAGCCCCGTTC 3117 

M MMMM M MMI M M MMMMM MMMMMMMMI Ml 

TTTGACCTGATTGCCAATGGGGGCTCTGCCTTGACTCTTCACTTTGAGCGAGCCCCTTTC 3 066 

ATGAGCCAGGAGCGCACTGTGTGGCTGCCGTGGAACAGCTTTTACGCCATGGACACCCTG 3177 

lllllll lllllllll II MM lllllllllllllll II IIMIIIIIIIIIII 
ATGAGCCGGGAGCGCACAGTATGGCCGCCGTGGAACAGCTTCTATGCCATGGACACCCTG 3126 

GTGATGAAGACCGAGGAGAACTCCATCCCCAGCTGTGACCTCAGTGGCTTTGTCCGGCCT 3237 

II llllllll MMMMIMIMMMMIIIMMMMMMMMIMMMM 

GTAATGAAGACGGAGGAGAACTCCATCCCCAGCTGTGACCTCAGTGGCTTTGTCCGGCCT 3186 
GATCCAATCATCATCTCCTCCCCACTGTCCACCTTCTTTAGTGCTGCCCCTGGGCAGAAT 3297 

Mill llllllllllllll M IIIIMIIilllM M III llllll I III 

GATCCGATCATCATCTCCTCTCCTCTGTCCACCTTCTTCAGCGCTTCCCCTGCGGCGAAC 3246 
CCCATCGTGCCTGAGACCCAGGTTCTTCATGAAGAAATCGAGCTCCCTGGTTCCAATGTG 3357 

Mill llllllllllllllllllllllllll II IIIIMIIIIIIII MM III 

CCCATTGTGCCTGAGACCCAGGTTCTTCATGAGGAGATCGAGCTCCCTGGCACCAACGTG 3306 
AAACTTCGCTATCTGAGCTCTAGAACTGCAGGGTACAAGTCACTGCTGAAGATCACCATG 3417 

II II II II II MMI Mill IIIIIIIIMMIMIIIIIIIIIIIIIIIIII 

AAGCTCCGTTACCTCAGCTCCAGAACAGCAGGGTACAAGTCACTGCTGAAGATCACCATG 3 366 

ACCCAGTCCACAGTGCCCCTGAACCTCATTAGGGTTCACCTGATGGTGGCTGTCGAGGGG 3477 

IIIIMIIIII llllll llllllllll llllllll lllllll II II llllll 
ACCCAGTCCACGGTGCCCTTGAACCTCATCCGGGTTCACTTGATGGTTGCCGTGGAGGGG 3426 

CATCTCTTCCAGAAGTCATTCCAGGCTTCTCCCAACCTGGCCTCCACCTTCATCTGGGAC 353 7 

MMMMMMMMI II I M II I II M 1 1 II I M II II II III IIIIIIIIIIM 

CATCTCTTCCAGAAGTCGTTCCAGGCTTCTCCCAACCTGGCCTACACATTCATCTGGGAC 3486 
AAGACAGATGCGTATGGCCAAAGGGTGTATGGACTCTCAGATGCTGTTGTGTCTGTCGGG 3597 

llllllll II llllllllllllll Mill II II Illllllllllllllll II 

AAGACAGACGCTTATGGCCAAAGGGTTTATGGCCTATCGGATGCTGTTGTGTCTGTTGGA 3 546 
TTTGAATATGAGACCTGTCCCAGTCTAATTCTCTGGGAGAAAAGGACAGCCCTCCTTCAG 3 657 

IIIIIIIIIIIMIIII llllllll II II Mill llllllllllllll Mill 

TTTGAATATGAGACCTGCCCCAGTCTCATCCTGTGGGAAAAAAGGACAGCCCTACTTCAA 3 606 
GGATTCGAGCTGGACCCCTCCAACCTCGGTGGCTGGTCCCTAGACAAACACCACATCCTC 3 717 

MMMMMMMMI llllllll llllllllllllll II II lllllll MM 

GGATTCGAGCTGGACCCTTCCAACCTTGGTGGCTGGTCCCTGGATAAGCACCACACCCTC 3666 
AATGTTAAAAGTGGAATCCTACACAAAGGCACTGGGGAAAACCAGTTCCTGACCCAGCAG 3777 

Mill Mill MMI MM lllllllll Mill II II II I II 1 1 1 II II 1 1 II I 

AATGTGAAAAGCGGAATACTACTCAAAGGCACAGGGGAGAACCAGTTCCTGACCCAGCAG 3 726 

CCTGCCATCATCACCAGCATCATGGGCAATGGTCGCCGCCGGAGCATTTCCTGTCCCAGC 3 837 

II lllllllllllllllllllllll II lllllllll I Mill llllllllllll 
CCCGCCATCATCACCAGCATCATGGGTAACGGTCGCCGCAGAAGCATCTCCTGTCCCAGC 3786 

TGCAACGGCCTTGCTGAAGGCAACAAGCTGCTGGCCCCAGTGGCTCTGGCTGTTGGAATC 3897 

Mill IIIIIIIIIIIIIIIIIIM III lllllll Mill llllllll II III 
TGCAATGGCCTTGCTGAAGGCAACAAACTGTTGGCCCCCGTGGCCCTGGCTGTGGGGATC 384 6 

GATGGGAGCCTCTATGTGGGTGACTTCAATTACATCCGACGCATCTTTCCCTCTCGAAAT 3957 

lllllllllllll III IIMIIIIMMII Mill lillllll II IIMIIII 

GATGGGAGCCTCTTTGTCGGTGACTTCAATTATATCCGGCGCATCTTCCCTTCTCGAAAC 3906 
GTGACCAGCATCTTGGAGTTACGAAATAAAGAGTTTAAACATAGCAACAACCCAGCACAC 4017 

llllllll lllllllllllllllllllllillllllllllllllllll Mill MM 

GTGACCAGTATCTTGGAGTTACGAAATAAAGAGTTTAAACATAGCAACAGCCCAGGACAC 3 966 
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SB JCT : 


3967 


QUERY : 


4078 


SB JCT : 


4027 


QUERY: 


4138 


SB JCT : 


4087 


QUERY: 


4198 


SB JCT : 


4147 


QUERY: 


4258 


SB JCT : 


4207 


QUERY: 


4318 


SB JCT : 


4267 


QUERY: 


4378 


SB JCT : 


4327 


QUERY: 


4438 


SB JCT : 


4387 


QUERY: 


4498 


SBJCT : 


4447 


QUERY : 


4558 


SBJCT : 


4507 


QUERY : 


4618 


SBJCT : 


4567 


QUERY: 


4678 


SBJCT : 


4627 


QUERY: 


4738 


SBJCT : 


4687 


QUERY: 


4798 


SBJCT : 


4747 


QUERY: 


4858 


SBJCT : 


4807 


QUERY : 


4918 


SBJCT: 


4867 


QUERY : 


4978 


SBJCT : 


4927 


QUERY: 


5038 


SBJCT : 


4987 



Mlillll 

AAGTACTAi 



Mill llllllll III I lllllllllll II 
?CTGTGGACCCTGTGACTGGCTCGCTCTATGT< 



G 



lllllllll 
CACCAACAGT 4026 



AGGAGAATCfTOCGCGTCAAGTCTCTGAGTGGAACCAAAGACCTGGCTGGGAATTCGGAA 4137 

I I llllllll lllllllllll II III llllllllllllllllllllllllll 
CGCCGGATCTACCGAGTCAAGTCTCTAAGCGGAGCCAAAGACCTGGCTGGGAATTCGGAA 4086 

GTTGTGGCAGGGACGGGAGAGCAGTGTCTACCCTTTGATGAAGCCCGCTGCGGGGATGGA 4197 

llllllll Mill II II II IIMIIIIIIIIIIIIIIIIIIIIII MMMM 

GTTGTGGCCGGGACTGGCGAACAATGTCTACCCTTTGATGAAGCCCGCTGTGGGGATGGC 4146 
GGGAAGGCCATAGATGCAACCCTGATGAGCCCGAGAGGTATTGCAGTAGACAAGAATGGG 4257 

MMMM I Mill MMMMMMM 1 1 II I II I II M I II M II I II I III 

GGGAAGGCTGTGGATGCCACCCTGATGAGCCCTAGAGGTATTGCAGTAGACAAGAACGGG 4206 
CTCATGTACTTTGTCGATGCCACCATGATCCGGAAGGTTGACCAGAATGGAATCATCTCC 4317 

II Mill Mill IIIIIIIIMIIIIillllllll Mill lllllllllllllll 

CTTATGTATTTTGTTGATGCCACCATGATCCGGAAGGTCGACCAAAATGGAATCATCTCC 4266 
ACCCTGCTGGGCTCCAATGACCTCACTGCCGTCCGGCCGCTGAGCTGTGATTCCAGCATG 4377 

IIIIIIIIIMIIIIIIIIIIIIIII II Mill II IIIIIIIIIM II IIIIM 

ACCCTGCTGGGCTCCAATGACCTCACAGCTGTCCGACCACTGAGCTGTGACTCTAGCATG 4326 
GATGTAGCCCAGGTTCGTCTGGAGTGGCCAACAGACCTTGCTGTCAATCCCATGGATAAC 4437 

II II llllllll Mill II Mill IIMIIIIIII Mill llllllll II 

GACGTGGCCCAGGTCCGTCTAGAATGGCCGACAGACCTTGCGGTCAACCCCATGGACAAT 43 86 
TCCTTGTATGTTCTAGAGAACAATGTCATCCTTCGAATCACCGAGAACCACCAAGTCAGC 44 97 

III MM II II llllllll llllllll II lllllllllll Mill llllll 

TCCCTGTACGTCCTGGAGAACAACGTCATCCTGCGGATCACCGAGAATCACCAGGTCAGC 4446 
ATCATTGCGGGACGCCCCATGCACTGCCAAGTTCCTGGCATTGACTACTCACTCAGCAAA 4557 

Mill llllllll MMMMMMM Mill Mill llllllll llllllll 

ATCATCGCGGGACGGCCCATGCACTGCCAGGTTCCCGGCATCGACTACTCGCTCAGCAAG 4506 
CTAGCCATTCACTCTGCCCTGGAGTCAGCCAGTGCCATTGCCATTTCTCACACTGGGGTC 4617 

II Mill llllllll MMMMMMM Mill MMMMMMM Mill 

CTCGCCATCCACTCTGCTCTGGAGTCAGCCAGCGCCATCGCCATTTCTCACACCGGGGTG 4566 

CTCTACATCACTGAGACAGATGAGAAGAAGATTAACCGTCTACGCCAGGTAACAACCAAC 4677 

lllllllllll Mill II lllllllllll Mill lllllllllll II llllll 
CTCTACATCACCGAGACGGACGAGAAGAAGATCAACCGCCTACGCCAGGTCACCACCAAC 4626 

GGGGAGATCTGCCTTTTAGCTGGGGCAGCCTCGGACTGCGACTGCAAAAACGATGTCAAT 473 7 

II lllllllllll Mill lllllllllll Mill lllllllllll II Mill 

GGAGAGATCTGCCTCTTAGCCGGGGCAGCCTCAGACTGTGACTGCAAAAATGACGTCAAC 4686 
TGCAACTGCTATTCAGGAGATGATGCCTACGCGACTGATGCCATCTTGAATTCCCCATCA 4797 

MM lllllllll llllllll II Mill II llllllllllllll Mill II 

TGCATCTGCTATTCGGGAGATGACGCATACGCCACGGATGCCATCTTGAACTCCCCGTCC 4746 
TCCTTAGCTGTAGCTCCAGATGGTACCATTTACATTGCAGACCTTGGAAATATTCGGATC 4 857 

lllllllllll Mill Mill Mill Mill llllllll II Mill llllll 

TCCTTAGCTGTGGCTCCGGATGGCACCATCTACATCGCAGACCTCGGGAATATCCGGATC 4806 

AGGGCGGTCAGCAAGAACAAGCCTGTTCTTAATGCCTTCAACCAGTATGAGGCTGCATCC 4917 

llllllllllllll Mill lllllllllll II llllllllllllllllllll II 
AGGGCGGTCAGCAAAAACAAACCTGTTCTTAACGCGTTCAACCAGTATGAGGCTGCGTCT 4866 

CCCGGAGAGCAGGAGTTATATGTTTTCAACGCTGATGGCATCCACCAATACACTGTGAGC 4977 

II Mill Mill I II II MMMM Mill Mill II Mill llllll 

CCGGGAGAACAGGAACTGTACGTGTTCAACGCCGATGGTATCCATCAGTACACCGTGAGC 4 926 

CTGGTGACAGGGGAGTACTTGTACAATTTCACATATAGTACTGACAATGATGTCACTGAA 5037 

llllllll lllllllllll IIIIIMIIII II II llllllllllllllll II 
CTGGTGACCGGGGAGTACTTATACAATTTCACCTACAGCGCTGACAATGATGTCACCGAG 4 986 

TTGATTGACAATAATGGGAATTCCCTGAAGATCCGTCGGGACAGCAGTGGCATGCCCCGT 5097 

lllllllllll II lllllllllll llllllll lllllllllllllllllllllll 
TTGATTGACAACAACGGGAATTCCCTAAAGATCCGCCGGGACAGCAGTGGCATGCCCCGA 5046 

60 15966-697 



QUERY: 
SB JCT : 
QUERY: 
SBJCT : 



5098 CACCTGCTi 

lllllill 
5047 CACCTGCTO?' 



W 

CATCC" 



CTGACAACCAGATCATCACCCTCACCGTGGGC^^^TGGAGGCCTC 5157 

Mi II I II III lllllill II lillilWI illllllll 
CCTGATAATCAGATCATCACCCTTACGGTGGGCACCAACGGAGGCCTC 5106 



5158 AAAGTCGTGTCCACACAGAACCTGGAGCTTGGTCTCATGACCTATGATGGCAACACTGGG 5217 

MM llllll II MMMIMMIM II llllllll IMIIIII IIIIIMI 

5107 AAAGCCGTGTCAACGCAGAACCTGGAGCTGGGCCTCATGACTTATGATGGGAACACTGGA 5166 



QUERY: 5218 CTCCTGGCCACCAAGAGCGATGAAACAGGATGGACGACTTTCTATGACTATGACCACGAA 5277 

Mill llllllllllllllllllll IIIMIM lllli lllllllliilllllll 

SBJCT: 5167 CTCCTAGCCACCAAGAGCGATGAAACCGGATGGACAACTTTTTATGACTATGACCACGAG 5226 
QUERY : 5278 GGCCGCCTGACCAACGTGACGCGCCCCACGGGGGTGGTAACCAGTCTGCACCGGGAAATG 533 7 

Mill IMIIIII lllli IMIIIillllllllll Mill Mlllllllllllll 

SBJCT: 5227 GG CCGTCTGACCAATGTGACTCGCCCCACGGGGGTGGTGACCAGCCTGCACCGGGAAATG 5286 
QUERY: 5338 GAGAAATCTATTACCATTGACATTGAGAACTCCAACCGTGATGATGACGTCACTGTCATC 5397 

llllllll II III IlillllllMllllllillllllll I II IMIIIII II 

SBJCT: 5287 GAGAAATCCATCACCGTTGACATTGAGAACTCCAACCGTGATAACGATGTCACTGTGATT 5346 
QUERY: 53 98 ACCAACCTCTCTTCAGTAGAGGCCTCCTACACAGTGGTACAAGATCAAGTTCGGAACAGC 5457 

lllllllliilllllll 1 1 II I nil 1 11 II III Mil I II II I III I II II II ill 

SBJCT: 5347 ACCAACCTCTCTTCAGTGGAGGCCTCCTACACCGTGGTACAAGATCAAGTGCGGAACAGC 5406 



QUERY: 5458 TACCAGCTCTGTAATAATGGTACCCTGAGGGTGATGTATGCTAATGGGATGGGTATCAGC 5517 

lllllilllli I II 11 lliill I 11 lliil li 11 11 lllli lllli 

SBJCT: 5407 TACCAGCTCTGCAGCAACGGGACCCTGCGCGTCATGTACGCCAACGGCATGGGCGTCAGC 5466 
QUERY: 5518 TTCCACAGCGAGCCCCATGTCCTAGCGGGCACCATCACCCCCACCATTGGACGCTGCAAC 5577 

lllllllllllllllll IIMI II llllll lllllllllllll II Mill III 

SBJCT: 54 67 TTCCACAGCGAGCCCCACGTCCTCGCAGGCACCCTCACCCCCACCATCGGGCGCTGTAAC 5526 
QUERY: 5578 ATCTCCCTGCCTATGGAGAATGGCTTAAACTCCATTGAGTGGCGCCTAAGAAAGGAACAG 5637 

lllli Mi III .11 Mil 11 Mi I 11 llllll illiiilllll 11 Mil Mill 

SBJCT: 552 7 ATCTCCCTGCCCATGGAGAACGGCCTGAACTCCATCGAGTGGCGCCTGAGGAAGGAACAG 5586 

QUERY: 563 8 ATTAAAGGCAAAGTCACCATCTTTGGCAGGAAGCTCCGGGTCCATGGAAGAAATCTCTTG 5697 

iliillllliilllillliiiliill liiliiil illiiill lllii 11 III II 

SBJCT: 5587 ATTAAAGGCAAAGTCACCATCTTTGGGAGGAAGCTTCGGGTCCACGGAAGGAACCTCCTG 5646 

QUERY: 5698 TCCATTGACTATGATCGAAATATTCGGACTGAAAAGATCTATGATGACCACCGGAAGTTC 5757 

llllllll Mill llllllll II Mill lllllllllll lllllllllllllll 

SBJCT: 564 7 TCCATTGATTATGACCGAAATATCCGCACTGAGAAGATCTATGACGACCACCGGAAGTTC 5706 
QUERY: 5758 ACCCTGAGGATCATTTATGACCAGGTGGGCCGCCCCTTCCTCTGGCTGCCCAGCAGCGGG 5817 

lllllllllllillllllllMIIIIMIIIIIilllllll Mill llllllll II 

SBJCT: 5707 ACCCTGAGGATCATTTATGACCAGGTGGGCCGCCCCTTCCTGTGGCTCCCCAGCAGTGGA 5766 
QUERY: 5818 CTGGCAGCTGTCAACGTGTCATACTTCTTCAATGGGCGCCTGGCTGGGCTTCAGCGTGGG 5877 

Mill II Mill II II lllllllllll lllllllllll II II Mill III 

SBJCT: 5767 CTGGCGGCCGTCAATGTCTCCTACTTCTTCAACGGGCGCCTGGCCGGCCTCCAGCGCGGG 5826 
QUERY: 5878 GCCATGAGCGAGAGGACAGACATCGACAAGCAAGGCCGCATCGTGTCCCGCATGTTCGCT 5937 

IIIIMIIMIMIIIIIIIMI IIMIIIIIilMI II llllllll llllllll 

SBJCT: 582 7 GCCATGAGCGAGAGGACAGACATTGACAAGCAAGGCCGGATTGTGTCCCGAATGTTCGCC 5886 
QUERY : 593 8 GACGGGAAAGTGTGGAGCTACTCCTACCTTGACAAGTCCATGGTCCTCCTGCTTCAGAGC 5997 

lllllllllll llllllll iiiiiiMiiiiiiilMillliiiiiiiiii llllll 

SBJCT: 5887 GACGGGAAAGTCTGGAGCTATTCCTACCTTGACAAGTCCATGGTCCTCCTGCTGCAGAGC 5946 
QUERY: 5998 CAACGTCAGTATATATTTGAGTATGACTCCTCTGACCGCCTCCTTGCCGTCACCATGCCC 6057 

II illlMII liiliiil MMMMMMMMMMM II illlilllllll 

SBJCT: 594 7 CAGCGTCAGTACATATTTGAATATGACTCCTCTGACCGCCTCCACGCAGTCACCATGCCC 6006 
QUERY: 6058 AGCGTGGCCCGGCACAGCATGTCCACACACACCTCCATCGGCTACATCCGTAATATTTAC 6117 

II M lllllllilMlilllllll lllllililll IMMMIMI M IIIIM 

SBJCT: 6007 AGTGTCGCCCGGCACAGCATGTCCACGCACACCTCCATTGGCTACATCCGGAACATTTAC 6066 
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QUERY: 
SBJCT: 
QUERY: 
SBJCT: 
QUERY: 
SBJCT: 
QUERY: 
SBJCT: 
QUERY: 
SBJCT: 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT: 
QUERY : 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT: 
QUERY: 
SBJCT: 
QUERY: 
SBJCT: 
QUERY: 
SBJCT: 
QUERY: 
SBJCT: 



6118 AACCCGCCT^^GCAATGCTTCGGTCATCTTTGACTACAGTGAT^^PCCGCATCCTG 6177 

Mill II Vrilll II IlllllllllllllllllllllllWlllllllllil 

6067 AACCCACCGGAAAGCAACGCCTCGGTCATCTTTGACTACAGTGATGACGGCCGCATCCTG 6126 



6178 AAGACCTCCTTTTTGGGCACCGGACGCCAGGTGTTCTACAAGTATGGGAAACTCTCCAAG 6237 

iiiii II II iiiiiiiiii llllllllllllll Mill II mil mill 

6127 AAGACGTCTTTCCTGGGCACCGGGCGCCAGGTGTTCTATAAGTACGGAAAACTGTCCAAG 6186 

6238 TTATCAGAGATTGTCTACGACAGTACCGCCGTCACCTTCGGGTATGACGAGACCACTGGT 6297 

IIIII IIIII lllllllllll II millllllllll lllllllllllllllil 

6187 TTATCGGAGATCGTCTACGACAGCACTGCCGTCACCTTCGGCTATGACGAGACCACTGGC 624 6 

6298 GTCTTGAAGATGGTCAACCTCCAAAGTGGGGGCTTCTCCTGCACCATCAGGTACCGGAAG 6357 

III IIIIIIIIII II llllllll llllllllllllll llllllllllllll III 

6247 GTCCTGAAGATGGTGAATCTCCAAAGCGGGGGCTTCTCCTGTACCATCAGGTACCGAAAG 6306 

6358 ATTGGCCCCCTGGTGGACAAGCAGATCTACAGGTTCTCCGAGGAAGGCATGGTCAATGCC 6417 

I II IIIII lllllllimill lllllllllll lllimillll MM III 

6307 GTCGGGCCCCTCGTGGACAAGCAGATTTACAGGTTCTCTGAGGAAGGCATGATCAACGCC 6366 

6418 AGGTTTGACTACACCTATCATGACAACAGCTTCCGCATCGCAAGCATCAAGCCCGTCATA 6477 

IIIII II llllllll II IIIIIIIIIMIIIIIIIII lllllllllllllllll 

6367 AGGTTCGATTACACCTACCACGACAACAGCTTCCGCATCGCCAGCATCAAGCCCGTCATC 6426 

6478 AGTGAGACTCCCCTCCCCGTTGACCTCTACCGCTATGATGAGATTTCTGGCAAGGTGGAA 6 53 7 

llllllllllllll llllllllllllllllllll IIIIIIIIIIIIMIIIIIIIIII 

6427 AGTGAGACTCCCCTTCCCGTTGACCTCTACCGCTACGATGAGATTTCTGGCAAGGTGGAA 64 86 

653 8 CACTTTGGTAAGTTTGGAGTCATCTATTATGACATCAACCAGATCATCACCACTGCCGTG 6597 

mil II IIIII II iiiiMii II iiiiiiiiiiiiMiiiiiiiiiiiim 

64 87 CACTTCGGCAAGTTCGGGGTCATCTACTACGACATCAACCAGATCATCACCACTGCCGTC 654 6 

6598 ATGACCCTCAGCAAACACTTCGACACCCATGGGCGGATCAAGGAGGTCCAGTATGAGATG 6657 

mil llllllll IIIII llllllllllllll llllllll II llllllllllll 

6547 ATGACACTCAGCAAGCACTTTGACACCCATGGGCGCATCAAGGAAGTGCAGTATGAGATG 6606 

6658 TTCCGGTCCCTCATGTACTGGATGACGGTGCAATATGACAGCATGGGCAGGGTGATCAAG 6717 

iiiimiiiiiimiiiiiiiiiiiiiiiiiiiiiiiii iiiiiiMiii mm 

6607 TTCCGGTCCCTCATGTACTGGATGACGGTGCAATATGACAGTATGGGCAGGGTCATCAAG 6666 

6718 AGGGAGCTAAAACTGGGGCCCTATGCCAATACCACGAAGTACACCTATGACTACGATGGG 6777 

IIIII II llllllllllllllllllll IIIII llllllllllllllllllll III 

6667 AGGGAACTGAAACTGGGGCCCTATGCCAACACCACAAAGTACACCTATGACTACGACGGG 6726 

6778 GACGGGCAGCTCCAGAGCGTGGCCGTCAATGACCGCCCGACCTGGCGCTACAGCTATGAC 683 7 

IIIII lllllllllll lllllllllllllllll II llllim II IIMIIIII 

6727 GACGGCCAGCTCCAGAGTGTGGCCGTCAATGACCGGCCTACCTGGCGTTATAGCTATGAC 6786 

6838 CTTAATGGGAATCTCCACTTACTGAACCCAGGCAACAGTGTGCGCCTCATGCCCTTGCGC 6897 

II llllllll II Ml I II llllllll lllllll lllllllllll II Ml 

6787 CTCAATGGGAACCTGCACCTGCTAAACCCAGGAAACAGTGCTCGCCTCATGCCGTTACGC 6846 

6898 TATGACCTCCGGGATCGGATAACCAGACTCGGGGATGTGCAGTACAAAATTGACGACGAT 6957 

lllllllllll II MMMIIIM II IIIII llllllllllllll II II III 

684 7 TATGACCTCCGTGACCGGATAACCAGGCTAGGGGACGTGCAGTACAAAATCGATGATGAT 6 906 

6958 GGCTATCTGTGCCAGAGAGGGTCTGACATCTTCGAATACAATTCCAAGGGCCTCCTAACA 7017 

mm I MMMIIIM lllllllllll llllllil MIIIMIIII IIIII 

6907 GGCTATTTATGCCAGAGAGGATCTGACATCTTTGAATACAACTCCAAGGGCCTTCTAACG 6966 

7018 AGAGCCTACAACAAGGCCAGCGGGTGGAGTGTCCAGTACCGCTATGATGGCGTAGGACGG 7077 

IIIII IIMimillllllllMIIIIIII IIIMIIIIIIIIIIIIIII I II 

6967 AGAGCGTACAACAAGGCCAGCGGGTGGAGTGTGCAGTACCGCTATGATGGCGTGAGCCGC 7026 



QUERY: 7078 CGGGCTTCCTACAAGACCAACCTGGGCCACCACCTGCAGTACTTCTACTCTGACCTCCAC 7137 

IIIMIIIIIIIIIIIIIillllllllllllllil lllllllllll M lllllllll 
SBJCT: 7027 CGGGCTTCCTACAAGACCAACCTGGGCCACCACCTACAGTACTTCTATTCCGACCTCCAC 7086 



62 
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QUERY: 


7138 


SB JCT : 


7087 


QUERY: 


7198 


SB JCT : 


7147 


QUERY: 


7258 


SB JCT : 


7207 


QUERY: 


7318 


SB JCT : 


7267 


QUERY : 


7378 


SB JCT : 


7327 


QUERY: 


7438 


SB JCT : 


7387 


QUERY: 


7498 


SB JCT: 


7447 


QUERY: 


7558 


SB JCT : 


7507 


QUERY: 


7618 


SB JCT: 


7567 


QUERY : 


7678 


SB JCT: 


7627 


QUERY: 


7738 


SB JCT : 


7687 


QUERY: 


7798 


SB JCT : 


7747 


QUERY: 


7858 


SB JCT : 


7807 


QUERY: 


7918 


SB JCT : 


7867 


QUERY : 


7978 


SB JCT : 


7927 


QUERY: 


8038 


SB JCT : 


7987 


QUERY: 


8098 


SB JCT: 


8047 


QUERY: 


8158 



AACCCGACGCGCATCACCCATGTCTACAATCACTCCAACTCGGAa 

MM MJ^MMMMM MMI MMMMMI M 

CACCCCAd^B^TCACCCATGTTTACAACCACTCCAACTCTGA< 



.G^gAi 



CCTCACTGTAC 7197 

MMMI Ml 

rCTCACTCTAC 7146 



TACGACCTCCAGGGCCACCTCTTTGCCATGGAGAGCAGCAGTGGGGAGGAGTACTATGTT 7257 

M MIMMMMMMMMMMMMMMMM MMMM MMMMMM 

TATGACCTCCAGGGCCACCTCTTTGCCATGGAGAGCAGTAGTGGGGAAGAGTACTATGTT 7206 
GCCTCTGATAACACAGGGACTCCTCTGGCTGTGTTCAGCATCAACGGCCTCATGATCAAA 7317 

MMI IIIIIIM IIIIIIIIMIIIIIII M II Mill IIMIIIIIIIIII 

GCCTCAGATAACACCGGGACTCCTCTGGCTGTTTTTAGTATCAATGGCCTCATGATCAAG 7266 
CAGCTGCAGTACACGGCCTATGGGGAGATTTATTATGACTCCAACCCCGACTTCCAGATG 73 77 

II II II Mill lllllllllllllllll MMMMMI II Mill llllll 

CAACTCCAATACACAGCCTATGGGGAGATTTACTATGACTCCAATCCAGACTTTCAGATG 7326 
GTCATTGGCTTCCATGGGGGACTCTATGACCCCCTGACCAAGCTGGTCCACTTCACTCAG 7437 

Mill llllllli II II Mill Illlllll Illlllll II Mill II III 

GTCATCGGCTTCCACGGAGGCCTCTACGACCCCCTCACCAAGCTCGTTCACTTTACGCAG 7386 
CGTGATTATGATGTGCTGGCAGGACGATGGACCTCCCCAGACTATACCATGTGGAAAAAC 74 97 

IIIIIMIIII llllllllllllll Mill Mill Mill IIIIIIIIM II 

CGTGATTATGACGTGCTGGCAGGACGGTGGACGTCCCCCGACTACACCATGTGGAGGAAT 7446 
GTGGGCAAGGAGCCGGCCCCCTTTAACCTGTATATGTTCAAGAGCAACAATCCTCTCAGC 7557 

IIIIIIMIIIIII Illlllll Illlllll IIIIIIIMI lllllllll Mill 

GTGGGCAAGGAGCCAGCCCCCTTCAACCTGTACATGTTCAAGAACAACAATCCACTCAGT 7506 
AGTGAGCTAGATTTGAAGAACTACGTGACAGATGTGAAAAGCTGGCTTGTGATGTTTGGA 7617 

I llllll Mill IIIIIIIIMMMIII MM! Illlllll llllllllllll 

AATGAGCTGGATTTAAAGAACTACGTGACAGACGTGAAGAGCTGGCTCGTGATGTTTGGA 7566 

TTTCAGCTTAGCAACATCATTCCTGGCTTCCCGAGAGCCAAAATGTATTTCGTGCCTCCT 7677 

Illlllll lllllllllllllllll Mill lllllllllllllllll Illlllll 
TTTCAGCTCAGCAACATCATTCCTGGATTCCCAAGAGCCAAAATGTATTTTGTGCCTCCC 7626 

CCCTATGAATTGTCAGAGAGTCAAGCAAGTGAGAATGGACAGCTCATTACAGGTGTCCAA 773 7 

lllllllll Illlllllll llilllllllllllllllllllllllllllllllllll 
CCCTATGAACTGTCAGAGAGCCAAGCAAGTGAGAATGGACAGCTCATTACAGGTGTCCAG 76 86 

CAGACAACAGAGAGACATAACCAGGCCTTCATGGCTCTGGAAGGACAGGTCATTACTAAA 7797 

llllllllllllll llllllllllllll MMMI llllllllllllll Mill 

CAGACAACAGAGAGGCATAACCAGGCCTTTCTGGCTCTAGAAGGACAGGTCATCTCTAAA 7746 

AAGCTCCACGCCAGCATCCGAGAGAAAGCAGGTCACTGGTTTGCCACCACCACGCCCATC 7857 

Illlllll II lllllllllllllllllll lllllllllll II llllllllllll 
AAGCTCCATGCAGGCATCCGAGAGAAAGCAGGCCACTGGTTTGCTACGACCACGCCCATC 7806 

ATTGGCAAAGGCATCATGTTTGCCATCAAAGAAGGGCGGGTGACCACGGGCGTGTCCAGC 7917 

II IIIMIIIIIIIIIIII MMMMMMMMMMMMM Illlllll III 

ATCGGCAAAGGCATCATGTTCGCCATCAAAGAAGGGCGGGTGACCACAGGCGTGTCTAGC 7866 
ATCGCCAGCGAAGATAGCCGCAAGGTGGCATCTGTGCTGAACAACGCCTACTACCTGGAC 7977 

Illlllll II II lllllllllll Mill III lllllllllllllllll Mill 

ATCGCCAGTGAGGACAGCCGCAAGGTAGCATCCGTGTTGAACAACGCCTACTACTTGGAC 7926 

AAGATGCACTACAGCATCGAGGGCAAGGACACCCACTACTTTGTGAAGATTGGCTCAGCC 8037 

lllillllllllllllllllllllllllllll Illlllll Illlllll II MM 
AAGATGCACTACAGCATCGAGGGCAAGGACACACACTACTTCGTGAAGATCGGTGCAGCG 7986 

GATGGCGACCTGGTCACACTAGGCACCACCATCGGCCGCAAGGTGCTAGAGAGCGGGGTG 8097 

II M Illlllll II II II Illlllll II MMMMMI MMMMMM 

GACGGTGACCTGGTTACGCTGGGGACCACCATTGGGCGCAAGGTGCTGGAGAGCGGGGTG 8046 
AACGTGACCGTGTCCCAGCCCACGCTGCTGGTCAACGGCAGGACTCGAAGGTTCACGAAC 8157 

llllllllllllll lllllllllllllllll lllllllllllllllllllllll III 

AACGTGACCGTGTCACAGCCCACGCTGCTGGTGAACGGCAGGACTCGAAGGTTCACCAAC 8106 



Mill llllllllllllllllllllllllllill Mill IIIIIIIIIIIIIIIM 
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SB JCT : 8107 ATTGAATTCCAGTACTCCACX5CTGCTGCTCAGCATACGCTACGGC 



SB JCT : 


; 8107 


QUERY: 


; 8218 


SB JCT : 


; 8167 


QUERY: 


; 8278 


SB JCT : 


: 8227 


QUERY : 


; 8338 


SB JCT : 


8287 


QUERY: 


8398 


SB JCT : 


8347 


QUERY : 


8458 


SB JCT : 


8407 


QUERY : 


8518 


SB JCT : 


8467 


QUERY : 


8578 1 


SB JCT: 


8527 1 


QUERY: 


8638 . 


SB JCT : 


8587 . 


QUERY: 


8698 1 


SB JCT : 


8647 i 


SCORE 


= 1459 



IC^U^( 



CCCCCGACACA 8166 



_ fVGGCCCGCGTCCTGGACCAGGCGAGACAGAGG(SWCTGGGCACGGCC 8277 

Mill llllllllllllllllllllllllll III llllllllllllllll II III 
SBJCT: 8167 CTGGATGAAGAGAAGGCCCGCGTCCTGGACCAAGCGCGACAGAGGGCCCTGGGTACTGCC 8226 



TGGGCCAAGGAGCAGCAGAAAGCCAGGGACGGGAGAGAGGGGAGCCGCCTGTGGACTGAG 8337 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII mil llllllll III 
TGGGCCAAGGAGCAGCAGAAAGCCAGGGACGGGAGAGAGGGCAGCCGTCTGTGGACGGAG 8286 

GGCGAGAAGCAGCAGCTTCTGAGCACCGGGCGCGTGCAAGGGTACGAGGGATATTACGTG 83 97 

lllllillllllll II llllllll 11 II llllllll II Mill lllllllll 
GGCGAGAAGCAGCAACTCCTGAGCACGGGACGGGTGCAAGGTTATGAGGGCTATTACGTG 8346 

CTTCCCGTGGAGCAATACCCAGAGCTTGCAGACAGTAGCAGCAACATCCAGTTTTTAAGA 8457 

Mill Mill II lllllllllll lllllllllllllllllllllilMI llllll 

CTTCCGGTGGAACAGTACCCAGAGCTGGCAGACAGTAGCAGCAACATCCAGTTCTTAAGA 8406 
CAGAATGAGATGGGAAAGAGGTAACAAAATAATCTGCTGCCATTCCTTGTCTGAATGGCT 8517 

IIIIMMMIMMMMMIMMIMMI IIIIIIMI Ml MM Mill 

CAGAATGAGATGGGAAAGAGGTAACAAAATAACCTGCTGCCACCTCTTCTCTGGGTGGCT 8466 
CAGCAGGAGTAACTGTTATCTCCTCTCCTAAGGAGATGAAGACCTAACAGGGGCACTGCG 8577 

lllllllll llllll I IIIIIIMI llllllll 1 1 III III II III MM I III I 

CAGCAGGAGCAACTGTGACCTCCTCTCCTAAGGAGACGAAGACCTAACAGGGGCACTGAG 8526 

GCTGGGCTGCTTTAGGAGACCAAGTGGCAAGAAAGCTCACATTTTTTGAGTTCAAATGCT 8637 

llllllllllllllll lllllllllllllllllllllllllllllllllllllllll 
GCCGGGCTGCTTTAGGACCCCAAGTGGCAAGAAAGCTCACATTTTTTGAGTTCAAATGCT 8586 

ACTGTCCAAGCGAGAAGTCCCTCATCCTGAAGTAGACTAAAGCCCGGCTGAAAATTCCGA 8697 

Illlllllllll lllllllllllllllllllllllll III MM llllll II 
ACTGTCCAAGCGCAAAGTCCCTCATCCTGAAGTAGACTAGAGCTCGGCCACAAATTCTGA 8646 



lllllllll 



0.0 



IDENTITIES = 1081/1196 (90%) 
STRAND = PLUS / PLUS 



QUERY : 


270 


SBJCT : 


123 


QUERY : 


330 


SBJCT : 


183 


QUERY : 


390 


SBJCT : 


243 


QUERY: 


450 


SBJCT : 


303 


QUERY : 


510 


SBJCT : 


363 


QUERY: 


570 


SBJCT : 


423 


QUERY: 


630 


SBJCT : 


483 



ATCTGGAATAATGGATGTAAAGGACCGGCGACACCGCTCTTTGACCAGAGGACGCTGTGG 329 

Mill Illlllllllll Mill llllllll llllllllllllll lllll Mill 

ATCTGCAATAATGGATGTGAAGGATCGGCGACATCGCTCTTTGACCAGGGGACGGTGTGG 182 

CAAAGAGTGTCGCTACACAAGCTCCTCTCTGGACAGTGAGGACTGCCGGGTGCCCACACA 3 89 

Ml llllllllllllll lllllllllllllllllllllllllllll llllllll II 
CAAGGAGTGTCGCTACACCAGCTCCTCTCTGGACAGTGAGGACTGCCGTGTGCCCACGCA 242 

GAAATCCTACAGCTCCAGTGAGACTCTGAAGGCCTATGACCATGACAGCAGGATGCACTA 449 

ill illlllll lllllllllll llllllll llilllillllllllll llllllll 
GAAGTCCTACAGTTCCAGTGAGACCCTGAAGGCTTATGACCATGACAGCAGAATGCACTA 302 

TGGAAACCGAGTCACAGACCTCATCCACCGGGAGTCAGATGAGTTTCCTAGACAAGGAAC 509 

MMMIIMMMMMMI I lllllllllll lllllllll llllllllll I 

TGGAAACCGAGTCACAGACCTGGTGCACCGGGAGTCCGATGAGTTTTCTAGACAAGGGGC 362 
CAACTTCACCCTTGCCGAACTGGGCATCTGTGAGCCCTCCCCACACCGAAGCGGCTACTG 569 

II llllllll II III MM lllll llllllllllllllllllll II lllll 

TAATTTCACCCTGGCAGAATTGGGAATCTGCGAGCCCTCCCCACACCGAAGTGGTTACTG 422 

CTCCGACATGGGGATCCTTCACCAGGGCTACTCCCTTAGCACAGGGTCTGACGCCGACTC 629 

lllllllllllllllll lllllllllllllllll lllll llllllll II lllll 
TTCCGACATGGGGATCCTCCACCAGGGCTACTCCCTGAGCACTGGGTCTGATGCGGACTC 482 

CGACACCGAGGGAGGGATGTCTCCAGAACACGCCATCAGACTGTGGGGCAGAGGGATAAA 689 

lllllllllllllllllllllllllllll lllllllllllllllll llllllllll 
GGACACCGAGGGAGGGATGTCTCCAGAACATGCCATCAGACTGTGGGGACGAGGGATAAA 542 
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QUERY: 


690 


SBJCT: 


543 


QUERY: 


750 


SBJCT : 


603 


QUERY: 


810 


SBJCT: 


663 


QUERY: 


870 


SBJCT : 


723 


QUERY: 


930 


SBJCT : 


783 


QUERY: 


990 


SBJCT : 


843 


QUERY : 


1050 


SBJCT : 


903 


QUERY: 


1110 


SBJCT : 


963 


QUERY : 


1170 


SBJCT : 


1023 


QUERY : 


1230 


SBJCT : 


1083 


QUERY : 


1290 


SBJCT : 


1143 


QUERY: 


1350 


SBJCT : 


1203 


QUERY: 


1410 


SBJCT : 


1263 


SCORE 


= 142 



ATCCAGGCGCAGTTCCGGCCTGTCCAGTCGTGAAAACTCGGCCCT 

III llllllll III lllllll II II Mill Mill 

ATCGAGGCd^B^TCTGGCTTGTCCAGCCGCGAGAACTCAGCCCT' 



:t^c< 



.cctgactgactc 

MM M 
ctgactgattc 



tgacaacgaaaacaaatcagatgatgagaacggtcgtcccattccacctacatcctcgcc 

MMM MMI MMI MMI M M MMI MMMMMMMMMMI I 
tgacaatgaaaataaatcggatgacgacaatggtcgacccattccacctacatcctcgtc 

tagtctcctcccatctgctcagctgcctagctcccataatcctccaccagttagctgcca 

Ml MMMMMMMMMMMMMIMMIMMMMMMMMMMMM 
tagcctcctcccatctgctcagctgcctagctcccataatcctccaccagttagctgcca 

gatgccattgctagacagcaacacctcccatcaaatcatggacaccaaccctgatgagga 

MMMMMMMMMMMMMMMMI MMMMMMMMI MMMM 
gatgccattgctagacagcaacacctcccatcagatcatggacaccaaccccgatgagga 

attctcccccaattcatacctgctcagagcatgctcagggccccagcaagcctccagcag 

lllllllll IMIIIIIIIIIIIIIIIIIIIIIIIIIIIMillllllllllllll M 

attctcccctaattcatacctgctcagagcatgctcagggccccagcaagcctccagtag 



749 



602 



809 



662 



869 



722 



929 



782 



989 



842 



TGGCCCTCCGAACCACCACAGCCAGTCGACTCTGAGGCCCCCTCTCCCACCCCCTCACAA 104 9 

iiiiiiiiiiiiiiiiiiiiiiiiiii II llllllllllllll mil Mill II 

TGGCCCTCCGAACCACCACAGCCAGTCAACGCTGAGGCCCCCTCTGCCACCTCCTCATAA 902 
CCACACGCTGTCCCATCACCACTCGTCCGCCAACTCCCTCAACAGGAACTCACTGACCAA 1109 

mm llllllll llllllll II lllllllllllllllll llllllllllllll 

CCACACCCTGTCCCACCACCACTCCTCTGCCAACTCCCTCAACAGAAACTCACTGACCAA 962 
TCGGCGGAGTCAGATCCACGCCCCGGCCCCAGCGCCCAATGACCTGGCCACCACACCAGA 1169 

MMMMIMI MMMMMI M M M MMMMMMMMMM M M 

TCGGCGGAGTCAAATCCACGCCCCAGCTCCTGCACCCAATGACCTGGCCACCACGCCGGA 1022 
GTCCGTTCAGCTTCAGGACAGCTGGGTGCTAAACAGCAACGTGCCACTGGAGACCCGGCA 1229 

MMMMMM MMMMMMMMI Mill llllllll IIIIIMI Mill 

GTCCGTTCAGCTCCAGGACAGCTGGGTGCTGAACAGTAACGTGCCGCTGGAGACGCGGCA 1082 
CTTCCTCTTCAAGACCTCCTCGGGGAGCACACCCTTGTTCAGCAGCTCTTCCCCGGGATA 12 89 

iiiiiiiiiiiiiii Mill II lllllllll iiiiiiiiiiiiiiii II mil 

CTTCCTCTTCAAGACGTCCTCCGGAAGCACACCCCTGTTCAGCAGCTCTTCTCCAGGATA 1142 
CCCTTTGACCTCAGGAACGGTTTACACGCCCCCGCCCCGCCTGCTGCCCAGGAATACTTT 134 9 

Ml lllllllllll II Mill II II M IIMIIIIIIIIII IIMIII II 

CCCCTTGACCTCAGGGACCGTTTATACACCACCACCCCGCCTGCTGCCACGGAATACATT 1202 
CTCCAGGAAGGCTTTCAAGCTGAAGAAGCCCTCCAAATACTGCAGCTGGAAATGTGCTGC 14 09 

III llllllll llllllllllllll lllllllllllllllll llllllll II II 

CTCTAGGAAGGCCTTCAAGCTGAAGAAACCCTCCAAATACTGCAGTTGGAAATGCGCCGC 1262 
CCTCTCCGCCATTGCCGCGGCCCTCCTCTTGGCTATTTTGCTGGCGTATTTCATAG 1465 

III II lllllllllll llllllll MM iiiimiiii llllllllll 

CCTGTCTGCCATTGCCGCTGCCCTCCTTCTGGCCATTTTGCTGGCCTATTTCATAG 1318 



IDENTITIES = 996/1088 (91%) 
STRAND = PLUS / PLUS 



QUERY: 


1464 


SBJCT : 


1440 


QUERY: 


1524 


SBJCT : 


1500 


QUERY: 


1584 


SBJCT : 


1560 


QUERY: 


1644 



AGTGCCCTGGTCGTTGAAAAACAGCAGCATAGACAGTGGTGAAGCAGAAGTTGGTCGGCG 1523 

IIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIII II II llllllll Mill II 

AGTGCCCTGGTCGTTGAAAAACAGCAGCATAGACAGCGGCGAGGCAGAAGTCGGTCGACG 14 99 
GGTAACACAAGAAGTCCCACCAGGGGTGTTTTGGAGGTCACAAATTCACATCAGTCAGCC 1583 

III mil iiiiiiiiiiiiiiiiiiiiiiiiiiiii II lllllllllllllllll 

GGTGACACAGGAAGTCCCACCAGGGGTGTTTTGGAGGTCCCAGATTCACATCAGTCAGCC 1559 
CCAGTTCTTAAAGTTCAACATCTCCCTCGGGAAGGACGCTCTCTTTGGTGTTTACATAAG 1643 

llllllllllllllllllllllllll llllllll II Mill II 11 llllllll 

TCAGTTCTTAAAGTTCAACATCTCCCTGGGGAAGGATGCCCTCTTCGGCGTCTACATAAG 1619 
AAGAGGACTTCCACCATCTCATGCCCAGTATGACTTCATGGAACGTCTGGACGGGAAGGA 1703 

lllllllll llllllllllllll iiiiiiiiiiiiiiiiiiii llllllll mil 
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SB JCT : 


1620 


QUERY: 


1704 


SB JCT : 


1680 


QUERY: 


1764 


SB JCT : 


1740 


QUERY: 


1824 


SB JCT : 


1800 


QUERY: 


1884 


SB JCT: 


1860 


QUERY: 


1944 


SB JCT: 


1920 


QUERY: 


2004 


SB JCT : 


1980 


QUERY: 


2064 


SB JCT : 


2040 


QUERY: 


2124 


SB JCT : 


2100 


QUERY: 


2184 


SB JCT : 


2160 


QUERY : 


2244 


SB JCT : 


2220 


QUERY: 


2304 


SB JCT : 


2280 


QUERY: 


2364 


SB JCT: 


2340 


QUERY: 


2424 


SB JCT: 


2400 


QUERY: 


2484 


SB JCT : 


2460 


QUERY: 


2544 


SB JCT: 


2520 



AAGAGGACTGCCACCATCTCATGCACAGTATGACTTCATGGAACG" 



GAAGTGGAcI^RgtTGAGTCTCCCAGGGAACGCCGGAGCATACAS^PTTGGTTCAGAA 1763 

IIMIIMIlTllli Mill IIIIIIIMIMIIIIIIil lIMM llll III!! 

GAAGTGGAGTGTGGTCGAGTCACCCAGGGAACGCCGGAGCATCCAGACCCTGGTGCAGAA 173 9 
TGAAGCCGTGTTTGTGCAGTACCTGGATGTGGGCCTGTGGCATCTGGCCTTCTACAATGA 1823 

II M Mill MIMIIII IIIMIIIIIIIIIIIIII II llllllllllllll 

CGAGGCTGTGTTCGTGCAGTACTTGGATGTGGGCCTGTGGCACCTCGCCTTCTACAATGA 1799 
TGGAAAAGACAAAGAGATGGTTTCCTTCAATACTGTTGTCCTAGATTCAGTGCAGGACTG 1883 

II II Mill IIIIIIM IIIIIIIIIII llllll Mlllllllllllllllll 

CGGCAAGGACAAGGAGATGGTCTCCTTCAATACGGTTGTCTTAGATTCAGTGCAGGACTG 1859 
TCCACGTAACTGCCATGGGAATGGTGAATGTGTGTCCGGGGTGTGTCACTGTTTCCCAGG 1943 

llllll llllllll IIMI II Mill Mill II lllllllllllllllllll 

TCCACGAAACTGCCACGGGAACGGCGAATGCGTGTCTGGACTGTGTCACTGTTTCCCAGG 1919 
ATTTCTAGGAGCAGACTGTGCTAAAGCTGCCTGCCCTGTCCTGTGCAGTGGGAATGGACA 2003 

III Mill llllllll IIIIIIIIIIIIIIIMIM llllllllllllllllllll 

ATTCCTAGGTGCAGACTGCGCTAAAGCTGCCTGCCCTGTTCTGTGCAGTGGGAATGGACA 1979 
ATATTCTAAAGGGACGTGCCAGTGCTACAGCGGCTGGAAAGGTGCAGAGTGCGACGTGCC 2063 

Mill llllllll IIIIIIIIMIIII IIIIIIIIIII Mill Mill Mill 

GTATTCCAAAGGGACATGCCAGTGCTACAGTGGCTGGAAAGGAGCAGAATGCGATGTGCC 2039 
CATGAATCAGTGCATCGATCCTTCCTGCGGGGGCCACGGCTCCTGCATTGATGGGAACTG 2123 

llllll IlillllMIIIIIIIIIII IIIIIIIIIIIIIIIIIIIIIIIMIIIMII 

CATGAACCAGTGCATCGATCCTTCCTGTGGGGGCCACGGCTCCTGCATTGATGGGAACTG 2099 
TGTCTGCTCTGCTGGCTACAAAGGCGAGCACTGTGAGGAAGTTGATTGCTTGGATCCCAC 2183 

II II I IIIIIIIIIII IIIIIIIIIII II Mill IIIIIIIIMIIII II 

CGTGTGTGCAGCTGGCTACAAGGGCGAGCACTGCGAAGAAGTGGATTGCTTGGATCCAAC 2159 
CTGCTCCAGCCACGGAGTCTGTGTGAATGGAGAATGCCTGTGCAGCCCTGGCTGGGGTGG 2243 

IIIIIIIIIIII II IIIIIIIIMI Mill M II llllllll llllllll M 

CTGCTCCAGCCATGGTGTCTGTGTGAACGGAGAGTGTCTATGCAGCCCCGGCTGGGGCGG 2219 
TCTGAACTGTGAGCTGGCGAGGGTCCAGTGCCCAGACCAGTGCAGTGGGCATGGCACGTA 23 03 

II Mill llllllilMMIIIIIIIIIIIIIIIIIIII llllllllllllll II 

GCTCAACTGCGAGCTGGCGAGGGTCCAGTGCCCAGACCAGTGTAGTGGGCATGGCACTTA 2279 
CCTGCCTGACACGGGCCTCTGCAGCTGCGATCCCAACTGGATGGGTCCCGACTGCTCTGT 23 63 

III llllll I IIIIIIIIM III Mill II lllllllllllllllllllllll 

CCTCCCTGACTCTGGCCTCTGCAACTGTGATCCGAATTGGATGGGTCCCGACTGCTCTGT 233 9 
TGAAGTGTGCTCAGTAGACTGTGGCACTCACGGCGTCTGCATCGGGGGAGCCTGCCGCTG 2423 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIilllllllllll 

TGAAGTGTGCTCAGTAGACTGTGGCACTCACGGCGTCTGCATCGGGGGAGCCTGCCGCTG 23 99 
TGAAGAGGGCTGGACAGGCGCAGCGTGTGACCAGCGCGTGTGCCACCCCCGCTGCATTGA 24 83 

lllllllllllllllllilll II lllllllllllllllllllllllllllllllllll 

TGAAGAGGGCTGGACAGGCGCGGCTTGTGACCAGCGCGTGTGCCACCCCCGCTGCATTGA 2459 
GCACGGGACCTGTAAAGATGGCAAATGTGAATGCCGAGAGGGCTGGAATGGTGAACACTG 2543 

II llll llllllll llllll II II MM IIIIIIIIIIII llllll MIMIIII II III 

GCACGGGACCTGTAAAGATGGCAAATGTGAATGCCGAGAGGGCTGGAATGGTGAACACTG 2 519 
CACCATTG 2551 

llllllll 

CACCATTG 2527 



;^gGG. 

^IBcT' 



lACGGAAAGGA 1679 



In this search it was also found that the FCTR3bcd and e nucleic acid had homology to 
six fragments of Gallus gallus partial mRNA for teneurin-2. It has 2780 of 3449 bases (80%) 
identical to bases 3386-6834, 1553 of 1862 bases (83%) identical to bases 1414-3275, 540 of 
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628 bases (85%) identi^p bases 587-1214, 593 of 725 bases {m^ identical to bases 7084- 
7808, 429 of 515 bases^W/o) identical to bases 7895-8409, and^Fof 475 bases (83%) 



identical to bases 20-494 of Gallus gallus partial mRNA for teneurin-2. (EMBL Acc: 
GGA278031) (Table 30). 

Table 30. BLASTN of FCTR3b, c, d, and e against Gallus gallus Teneurin-2 mRNA (SEQ 

ID NO:67) 

> GI 1 10241573 IEMB|AJ279031. l|GGA279031 GALLUS GALLUS PARTIAL MRNA FOR TENEURIN-2 (TEN2 
GENE) , LONG SPLICE 
VARIANT 
LENGTH = 8409 



SCORE = 1532 BITS (773), EXPECT 
IDENTITIES = 2780/3449 (80%) 
STRAND = PLUS / PLUS 



0.0 



QUERY: 


3458 


SB JCT : 


3386 


QUERY: 


3518 


SB JCT : 


3446 


QUERY: 


3578 


SB JCT : 


3506 


QUERY: 


3638 


SB JCT : 


3566 


QUERY : 


3698 


SB JCT : 


3626 


QUERY : 


3 758 


SB JCT : 


3686 


QUERY: 


3818 


SB JCT : 


3746 


QUERY: 


3878 


SB JCT: 


3806 


QUERY : 


3938 


SB JCT: 


3866 


QUERY: 


3998 


SB JCT : 


3926 


QUERY: 


4058 


SBJCT: 


3986 


QUERY: 


4118 


SBJCT: 


4046 



TGATGGTGGCTGTCGAGGGGCATCTCTTCCAGAAGTCATTCCAGGCTTCTCCCAACCTGG 3517 

Illllll II II II llllllll II II II Mill I III lllllllll III 

TGATGGTAGCAGTAGAAGGGCATCTATTTCAAAAATCATTTCTGGCATCTCCCAACTTGG 3445 

CCTCCACCTTCATCTGGGACAAGACAGATGCGTATGGCCAAAGGGTGTATGGACTCTCAG 3577 

I I II llllllllllllll llllllll Mill II I III Mill I MM 

CTTATACATTCATCTGGGACAAAACAGATGCATATGGTCAGAAGGTTTATGGGTTGTCAG 3505 
ATGCTGTTGTGTCTGTCGGGTTTGAATATGAGACCTGTCCCAGTCTAATTCTCTGGGAGA 3637 

Illllll II Mill II llllllllllllll lllllllll I Mill Illllll 

ATGCTGTAGTTTCTGTGGGTTTTGAATATGAGACTTGTCCCAGTTTGATTCTGTGGGAGA 3565 
AAAGGACAGCCCTCCTTCAGGGATTCGAGCTGGACCCCTCCAACCTCGGTGGCTGGTCCC 3697 

Illllll II II II II Mill Mill II M Mill II II II Mill 

AAAGGACTGCGCTGCTGCAAGGATTTGAGCTAGATCCTTCCAATCTAGGAGGATGGTCTT 3625 
TAGACAAACACCACATCCTCAATGTTAAAAGTGGAATCCTACACAAAGGCACTGGGGAAA 3757 

I II lllll II I II I II MM Mill II I llllllllll III MM 

TGGATAAACATCATGTACTGAATGTCAAGAGTGGTATATTGCACAAAGGCAATGGAGAAA 3685 



I lllll II II Mllllll II I II llllllll lllll IIIM lllll 



GGAGCATTTCCTGTCCCAGCTGCAACGGCCTTGCTGAAGGCAACAAGCTGCTGGCCCCAG 3 877 

I lllll llllllll llllllll II lllll lllll II lllll Illllll I 

GAAGCATATCCTGTCCTAGCTGCAATGGTCTTGCAGAAGGAAATAAGCTTTTGGCCCCTG 3 805 

TGGCTCTGGCTGTTGGAATCGATGGGAGCCTCTATGTGGGTGACTTCAATTACATCCGAC 3 937 

I II lllll II lllll lllll Illllll III II II II llllllll II I 

TAGCACTGGCAGTGGGAATTGATGGAAGCCTCTTTGTTGGAGATTTTAATTACATTCGGC 3 865 

GCATCTTTCCCTCTCGAAATGTGACCAGCATCTTGGAGTTACGAAATAAAGAGTTTAAAC 3 997 

I lllll II II I llllllll Mill IIIMI I llllllllllllllllll 

GTATCTTCCCATCCAGGAATGTGACTAGCATATTGGAGCTGAGAAATAAAGAGTTTAAAC 3 925 

ATAGCAACAACCCAGCACACAAGTACTACTTGGCAGTGGACCCCGTGTCCGGCTCGCTCT 4057 

llllllllll II M IIIM IIIM MM IIIIIIIMII II lllll II I 

ATAGCAACAATCCTGCTCACAAATACTATCTGGCCGTGGACCCCGTTTCGGGCTCCCTGT 3985 

ACGTGTCCGACACCAACAGCAGGAGAATCTACCGCGTCAAGTCTCTGAGTGGAACCAAAG 4117 

MM II IIIIIIIMIII I I II III lllll Mill I III II MM 

ACGTATCAGACACCAACAGCCGACGGATATACAAAGTCAAATCTCTTACTGGCACGAAAG 4045 

ACCTGGCTGGGAATTCGGAAGTTGTGGCAGGGACGGGAGAGCAGTGTCTACCCTTTGATG 4177 

llllllllll lllll lllll II II lllll llllllll II II llllllllll 

ACCTGGCTGGTAATTCTGAAGTGGTAGCGGGGACTGGAGAGCAATGCCTGCCCTTTGATG 4105 
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4178 AAGCCCGCTGCGGGGATGGAGGGAAGGCCATAGATGCAACCCTGA- 

• IMIMMIII M I II llllllll I 
AGATGGAGGGAAAGCAGTGGACGCAACCCTAA' 



fCCCGAGAGGTA 4237 

II MM I 

rTCCTCGAGGAA 4165 



QUERY: 
SBJCT: 

QUERY: 4238 TTGCAGTAGACAAGAATGGGCTCATGTACTTTGTCGATGCCACCATGATCCGGAAGGTTG 42 97 

Illllll II III MM llllllll lllll MIIIIM IIIM II II II I 

SBJCT: 4166 TTGCAGTGGATAAGTATGGACTCATGTATTTTGTTGATGCCACTATGATTCGAAAAGTGG 4225 

QUERY: 42 98 ACCAGAATGGAATCATCTCCACCCTGCTGGGCTCCAATGACCTCACTGCCGTCCGGCCGC 4357 

I IIIIMIIIII 11 II II llllllllllilllllllll Illllllllll II I 
SBJCT: 4226 ATCAGAATGGAATTATATCAACTCTGCTGGGCTCCAATGACCTAACTGCCGTCCGACCTC 4285 

QUERY: 4358 TGAGCTGTGATTCCAGCATGGATGTAGCCCAGGTTCGTCTGGAGTGGCCAACAGACCTTG 4417 

I IIIIMIIIIIIIIIIIIIIIII IIIMI II IIIIIIIMII II 11 II I 
SBJCT: 42 86 TAAGCTGTGATTCCAGCATGGATGTCAGCCAGGTACGGCTGGAGTGGCCTACTGATCTCG 434 5 

QUERY: 4418 CTGTCAATCCCATGGATAACTCCTTGTATGTTCTAGAGAACAATGTCATCCTTCGAATCA 4477 

lllll llllllilll Mill I IIIM Mllllllllllll II I II MM 

SBJCT: 4346 CTGTCGATCCCATGGACAACTCACTTTATGTCCTAGAGAACAATGTTATTTTACGGATCA 4405 

QUERY: 4478 CCGAGAACCACCAAGTCAGCATCATTGCGGGACGCCCCATGCACTGCCAAGTTCCTGGCA 4537 

I II Mill lllll Mill lllll IIIIIIIIMIIIIIIIIII llllllll I 
SBJCT: 4406 CAGAAAACCATCAAGTTAGCATTATTGCTGGACGCCCCATGCACTGCCAGGTTCCTGGTA 4465 

QUERY: 4 53 8 TTGACTACTCACTCAGCAAACTAGCCATTCACTCTGCCCTGGAGTCAGCCAGTGCCATTG 4597 

I IIIIMII M llllllll II lllll II II II II IIIIMIIIIIIIIII 

SBJCT: 4466 TAGACTACTCTCTTAGCAAACTGGCTATTCATTCCGCACTTGAATCAGCCAGTGCCATTG 4525 
QUERY: 4 598 CCATTTCTCACACTGGGGTCCTCTACATCACTGAGACAGATGAGAAGAAGATTAACCGTC 4657 

MM M lllll II II II Illllll llllllllllll II II lllll II I 

SBJCT: 4 526 CCATCTCACACACAGGAGTTCTTTACATCAGTGAGACAGATGAAAAAAAAATTAATCGGC 4585 

QUERY : 4658 TACGCCAGGTAACAACCAACGGGGAGATCTGCCTTTTAGCTGGGGCAGCCTCGGACTGCG 4717 

lllllllllllll lllll II II II llllll I II llllllll II lllll I 
SBJCT: 4586 TACGCCAGGTAACTACCAATGGAGAAATATGCCTTCTTGCAGGGGCAGCTTCAGACTGTG 4645 

QUERY: 4718 ACTGCAAAAACGATGTCAATTGCAACTGCTATTCAGGAGATGATGCCTACGCGACTGATG 4777 

I llllllll MIIIIM II II llllllll II Illllll II II Illllll 
SBJCT: 464 6 ATTGCAAAT^TGATGTCAACTGTAATTGCTATTCTGGGGATGATGGGTATGCCACTGATG 4 705 

QUERY: 4778 CCATCTTGAATTCCCCATCATCCTTAGCTGTAGCTCCAGATGGTACCATTTACATTGCAG 4837 

Illllll lllll lllll Illllllllll II Mllllllllllll lllll II I 
SBJCT: 4706 CCATCTTAAATTCACCATCTTCCTTAGCTGTGGCCCCAGATGGTACCATCTACATAGCTG 4 765 

QUERY: 4 83 8 ACCTTGGAAATATTCGGATCAGGGCGGTCAGCAAGAACAAGCCTGTTCTTAATGCCTTCA 4 897 

I II llllllll II II lllll IIIM II MM III llllllll I II I 
SBJCT: 4 766 ATCTCGGAAATATCCGCATTAGGGCTGTCAGTAAAAACAGGCCCATTCTTAATTCTTTTA 4 825 

QUERY: 4 898 ACCAGTATGAGGCTGCATCCCCCGGAGAGCAGGAGTTATATGTTTTCAACGCTGATGGCA 4 957 

MM lllll lllllill II lllll llllll I lllll lllll Mllllll I 
SBJCT: 4 826 ACCAATATGAAGCTGCATCTCCAGGAGAACAGGAGCTGTATGTCTTCAATGCTGATGGGA 4 885 

QUERY: 4 958 TCCACCAATACACTGTGAGCCTGGTGACAGGGGAGTACTTGTACAATTTCACATATAGTA 5017 

I lllll llllll I lllll II II IIIIIIIIIIIIIIIIIIIIIM lllll I 
SBJCT : 4 886 TTCACCAGTACACTCTCAGCCTTGTTACCGGGGAGTACTTGTACAATTTCACCTATAGCA 4 945 

QUERY: 5018 CTGACAATGATGTCACTGAATTGATTGACAATAATGGGAATTCCCTGAAGATCCGTCGGG 5077 

III II llllllll II MM MM lllll II III lllll lllllllll 

SBJCT: 4946 GTGATAACGATGTCACCGAGGTGATGGACAGCAATGGCAACTCCTTGAAGGTCCGTCGGG 5005 
QUERY : 5078 ACAGCAGTGGCATGCCCCGTCACCTGCTCATGCCTGACAACCAGATCATCACCCTCACCG 5137 

I III M Mllllll II I II llllllll II lllll MM II III 

SBJCT: 5006 ATGCCAGCGGAATGCCCCGCCATTTACTGATGCCTGATAATCAGATTGTCACGCTGGCCG 5065 

QUERY: 5138 TGGGCACCAATGGAGGCCTCAAAGTCGTGTCCACACAGAACCTGGAGCTTGGTCTCATGA 5197 

i lllll lllll 11 llllll I II II II MM llllll lllll I Mil 
SBJCT: 5066 TTGGCACTAATGGTGGACTCAAACTAGTCTCAACGCAGACCCTGGAACTTGGATTAATGA 5125 

QUERY: 5198 CCTATGATGGCAACACTGGGCTCCTGGCCACCAAGAGCGATGAAACAGGATGGACGACTT 5257 
I III I II MM III III I II II lllll IIIIIIIIIIMIIIII II I 

68 15966-697 







SB JCT : 


5126 






QUERY: 


5258 




C 
J 


SB JCT : 


5186 






QUERY: 


5318 






SB JCT : 


5246 




10 










QUERY: 


5378 






SB JCT : 


5306 




15 


QUERY: 


5438 






SB JCT : 


5366 






QUERY: 


5498 




20 










SB JCT : 


5426 






QUERY: 


5558 




25 


SB JCT : 


5486 






QUERY: 


5618 














SB JCT : 


5546 


ill 


30 












D O / O 














SB JCT : 


5606 










-pell 


3S 


yuriKi : 


3 / J O 


;?! 












SB JCT : 


5666 














QUERY: 


5798 


rsT 


40 






U.l 




SB JCT : 


5726 


! f? 




QUERY: 


5858 






SB JCT : 


5786 






QUERY : 


5918 






SB JCT : 


5846 




50 










QUERY: 


5978 






SB JCT : 


5906 




r c 
JJ 


QUERY: 


6038 






SB JCT : 


5966 






QUERY: 


6098 




60 










SB JCT : 


6026 






QUERY : 


6158 






SB JCT: 


6086 






QUERY: 


6218 






SB JCT: 


6146 



5126 CTTATAACGGAAACAGTGGTCTCTTAGCAACGAAGAGTGATGAAACAfiGATGGACAACAT 5185 



imTTi 



:lBiG 



.CCACGAAGGCCGCCTGACCAACGTGACGCGCCllPCGGGGGTGGTAA 5317 

I llllllirTTi II Mill lllllllllll II II II Mill II Mllill 

5186 TTTATGACTATGATCATGAAGGGCGCCTGACCAATGTAACACGTCCCACTGGAGTGGTAA 5245 



CCAGTCTGCACCGGGAAATGGAGAAATCTATTACCATTGACATTGAGAACTCCAACCGTG 5377 

I II II II II llllllll II Mlllllllll IIIIMIIIII II II II I 
CTAGCCTTCATCGAGAAATGGAAAAGTCTATTACCATCGACATTGAGAATTCTAATCGGG 5305 

ATGATGACGTCACTGTCATCACCAACCTCTCTTCAGTAGAGGCCTCCTACACAGTGGTAC 5437 

Illllll Mill llllllll II Mill II II Mill Mill Mill II I 

ATGATGATGTCACGGTCATCACAAATCTCTCCTCTGTGGAGGCTTCCTATACAGTTGTTC 5365 
AAGATCAAGTTCGGAACAGCTACCAGCTCTGTAATAATGGTACCCTGAGGGTGATGTATG 5497 

llllllllll IIIIMIIIIIIIMIIIIIIIIIIIIIIII MM llllllllll 

AAGATCAAGTGAGGAACAGCTACCAGCTCTGTAATAATGGTACTTTGAGAGTGATGTATG 5425 
CTAATGGGATGGGTATCAGCTTCCACAGCGAGCCCCATGTCCTAGCGGGCACCATCACCC 5557 

I Mill III MM Mill llllllll II llllllll II II II I II I 

CCAATGGCATGAGTATTAGCTTTCACAGCGAACCTCATGTCCTGGCTGGGACAGTAACTC 54 85 
CCACCATTGGACGCTGCAACATCTCCCTGCCTATGGAGAATGGCTTAAACTCCATTGAGT 5617 

Mill 1 1 Mill II II II II II II IIIIMIIIII II Mill Mill I 

CCACCATAGGACGATGTAATATTTCTCTACCAATGGAGAATGGTTTGAACTCAATTGAAT 5545 



GGCGCCTAAGAAAGGAACAGATTAAAGGCAAAGTCACCATCTTTGGCAGGAAGCTCCGGG 

MM II II II IIIIIIIIIIIIIIIIIIM II I Mill II llllll III 

GGCGTCTGAGGAAAGAACAGATTAAAGGCAAAGTGACTGTGTTTGGAAGAAAGCTCAGGG 



5677 



5605 



TCCATGGAAGAAATCTCTTGTCCATTGACTATGATCGAAATATTCGGACTGAAAAGATCT 5737 

I llllllll III I llllllllll II M II Mill II II Mill MM 

TTCATGGAAGGAATTTGCTGTCCATTGATTACGACCGGAATATACGCACAGAAAAAATCT 5665 
ATGATGACCACCGGAAGTTCACCCTGAGGATCATTTATGACCAGGTGGGCCGCCCCTTCC 5797 

I Mill Mill lllllllllllllllll MM! II III MM II Illllll 

ACGATGATCACCGCAAGTTCACCCTGAGGATAATTTACGATCAGCTGGGACGGCCCTTCC 5725 



TCTGGCTGCCCAGCAGCGGGCTGGCAGCTGTCAACGTGTCATACTTCTTCAATGGGCGCC 

lllllllllllllllllll Mill II lllllllllll II llllllll Illllll 
TCTGGCTGCCCAGCAGCGGCCTGGCTGCCGTCAACGTGTCCTATTTCTTCAACGGGCGCC 



5857 



5785 



70 



TGGCTGGGCTTCAGCGTGGGGCCATGAGCGAGAGGACAGACATCGACAAGCAAGGCCGCA 5917 

II II II III Mill II II lllllllllll llllllllllllllllllllllll I I 
TGGCTGGGCTTCAGCGCGGAGCCATGAGCGAAAGGACAGACATCGACAAGCAAGGCAGGA 5845 

TCGTGTCCCGCATGTTCGCTGACGGGAAAGTGTGGAGCTACTCCTACCTTGACAAGTCCA 5977 

M I II llllllll II II IIIM II Mill III Illllll II II MM 

TCATATCGCGCATGTTTGCAGATGGGAAGGTTTGGAGTTACACCTACCTAGAAAAATCCA 5905 
TGGTCCTCCTGCTTCAGAGCCAACGTCAGTATATATTTGAGTATGACTCCTCTGACCGCC 6 037 

MM II llllllllllllll II Mill II lllllllllll M II Mill I 

TGGTACTACTGCTTCAGAGCCAGCGGCAGTACATCTTTGAGTATGATTCTTCAGACCGGC 5965 
TCCTTGCCGTCACCATGCCCAGCGTGGCCCGGCACAGCATGTCCACACACACCTCCATCG 6097 

III III II II Mill II II II Mill llllllll M Mill II I I 

TCCATGCTGTTACTATGCCTAGTGTTGCTCGGCATAGCATGTCAACTCACACGTCTGTTG 6025 
GCTACATCCGTAATATTTACAACCCGCCTGAAAGCAATGCTTCGGTCATCTTTGACTACA 6157 

Illllll I IIIIIIM M II lllllllllll II II II M IIIM MM 

GCTACATTAGGAATATTTATAATCCTCCTGAAAGCAACGCATCAGTGATTTTTGATTACA 6085 
GTGATGACGGCCGCATCCTGAAGACCTCCTTTTTGGGCACCGGACGCCAGGTGTTCTACA 6217 

Illllll II I II MM II II IIIM II II II II II II II MM 

GTGATGATGGGAGGATTTTGAAAACATCATTTTTAGGTACTGGTCGACAAGTCTTTTACA 6145 

AGTATGGGAAACTCTCCAAGTTATCAGAGATTGTCTACGACAGTACCGCCGTCACCTTCG 6277 

Illllll II II Mill Mill II Mill II llllllll II II II M I 
AGTATGGAAAGCTATCCAAATTATCTGAAATTGTTTATGACAGTACTGCGGTTACTTTTG 6205 
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QUERY: 


6278 


SB JCT : 


6206 


QUERY: 


6338 


SB JCT : 


6266 


QUERY: 


6398 


SB JCT : 


6326 


QUERY : 


6458 


SB JCT : 


6386 


QUERY: 


6518 


SB JCT: 


6446 


QUERY: 


6578 


SB JCT : 


6506 


QUERY: 


6638 


SB JCT : 


6566 


QUERY: 


6698 


SB JCT : 


6626 


QUERY : 


6758 


SB JCT : 


6686 


QUERY: 


6818 


SB JCT : 


6746 


QUERY : 


6878 


SB JCT : 


6806 



GGTATGACGAGACCACTGGTGTCTTGAAGATGGTCAACCTCCAAAI 

I Mill Ai II llllll llllllll II I Mill 
GATATGATd^BcTACAGGTGTCCTAAAAATGGTGAATTTGCAAA) 



IGGGCTTCTCCT 63 3 7 

II II II I 

lAGGATTTTCTT 6265 



GCACCATCAGGTACCGGAAGATTGGCCCCCTGGTGGACAAGCAGATCTACAGGTTCTCCG 63 97 

I II III I II II II lllillll II II lllll II IIIMIII lllll I 
GTACAATCCGCTATCGTAAAATTGGCCCTCTTGTTGACAAACAAATCTACAGATTCTCTG 6325 

AGGAAGGCATGGTCAATGCCAGGTTTGACTACACCTATCATGACAACAGCTTCCGCATCG 64 57 

I lllll lllllllllll llllllll II II lllll Mill II II Mill I 
AAGAAGGTATGGTCAATGCAAGGTTTGATTATACATATCACGACAATAGTTTTCGCATTG 6385 

CAAGCATCAAGCCCGTCATAAGTGAGACTCCCCTCCCCGTTGACCTCTACCGCTATGATG 6517 

IIIIIMIII Ml MIIIMMMMIM II II Mill II lllll lllllll 

CAAGCATCAAACCCATCATAAGTGAGACTCCTCTTCCAGTTGATCTTTACCGTTATGATG 6445 
AGATTTCTGGCAAGGTGGAACACTTTGGTAAGTTTGGAGTCATCTATTATGACATCAACC 6577 

Illllllllllll II II II lllll II llllllll II llllllll II II I 

AGATTTCTGGCAAAGTTGAGCATTTTGGCAAATTTGGAGTTATTTATTATGATATAAATC 6505 

AGATCATCACCACTGCCGTGATGACCCTCAGCAAACACTTCGACACCCATGGGCGGATCA 663 7 

I II II II II II II lllll M II M lllll II lllll II II II I 
AAATTATTACTACAGCAGTTATGACACTGAGTAAGCACTTTGATACCCACGGACGCATTA 6565 

AGGAGGTCCAGTATGAGATGTTCCGGTCCCTCATGTACTGGATGACGGTGCAATATGACA 6697 

I II II II llllllllllllll lllll llllllllllllll illllllllllll 
AAGAAGTTCAATATGAGATGTTCCGATCCCTGATGTACTGGATGACTGTGCAATATGACA 6625 

GCATGGGCAGGGTGATCAAGAGGGAGCTAAAACTGGGGCCCTATGCCAATACCACGAAGT 6757 

lllllll II II I II II II II lllll lllll llllllll II II MM 
GCATGGGAAGAGTAACTAAAAGAGAACTGAAACTTGGGCCGTATGCCAACACAACCAAGT 6685 

ACACCTATGACTACGATGGGGACGGGCAGCTCCAGAGCGTGGCCGTCAATGACCGCCCGA 6817 

I llllilll II lllll II lllll I II lllll II II lllll I II I 
ATACCTATGATTATGATGGAGATGGGCAATTGCAAAGCGTAGCAGTAAATGATAGGCCTA 674 5 

CCTGGCGCTACAGCTATGACCTTAATGGGAATCTCCACTTACTGAACCCAGGCAACAGTG 6877 

lllllll lllll llllllll lllll lllll III I lllll II II lllllll 
CCTGGCGTTACAGTTATGACCTGAATGGAAATCTTCACCTCCTGAATCCTGGAAACAGTG 6805 

TGCGCCTCATGCCCTTGCGCTATGACCTC 6906 

I II I llllll lllllll llllll 
TTCGATTGATGCCCCTGCGCTACGACCTC 6834 



SCORE = 1241 BITS (626), EXPECT 
IDENTITIES = 1553/1862 (83%) 
STRAND = PLUS / PLUS 



0.0 



QUERY: 


1486 


SB JCT : 


1414 


QUERY: 


1546 


SB JCT : 


1474 


QUERY: 


1606 


SB JCT : 


1534 


QUERY: 


1666 


SB JCT : 


1594 


QUERY: 


1726 


SB JCT : 


1654 


QUERY: 


1786 



AGCAGCATAGACAGTGGTGAAGCAGAAGTTGGTCGGCGGGTAACACAAGAAGTCCCACCA 1545 

lllllllllll lllll III Illlllllll II III II lllll M II II 
AGCAGCATAGATAGTGGAGAAACAGAAGTTGGCCGCAAGGTCACCCAAGAGGTGCCCCCT 1473 

GGGGTGTTTTGGAGGTCACAAATTCACATCAGTCAGCCCCAGTTCTTAAAGTTCAACATC 1605 

II lllll III MM II II II lllll lllll llllll I lllllllllll 
GGAGTGTTCTGGCGGTCTCAGATCCATATCAGCCAGCCACAGTTCCTGAAGTTCAACATA 1533 

TCCCTCGGGAAGGACGCTCTCTTTGGTGTTTACATAAGAAGAGGACTTCCACCATCTCAT 1665 

lllll llllllll lllll II llllllll llllllllllllll llllllll III 
TCCCTAGGGAAGGATGCTCTTTTCGGTGTTTATATAAGAAGAGGACTCCCACCATC:ACAT 1593 

GCCCAGTATGACTTCATGGAACGTCTGGACGGGAAGGAGAAGTGGAGTGTGGTTGAGTCT 172 5 

II llllllll lllllllllll MM lllll Mill illllllllll II II 

GCACAGTATGATTTCATGGAACGCTTGGATGGGAAAGAGAAATGGAGTGTGGTGGAATCC 1653 
CCCAGGGAACGCCGGAGCATACAGACCTTGGTTCAGAATGAAGCCGTGTTTGTGCAGTAC 1785 

M lllllll II 11 il lllll I IMMIIMM II llllllll llllll 

CCACGGGAACGGCGAAGTATTCAGACTCTTGTTCAGAATGAGGCTGTGTTTGTTCAGTAC 1713 
CTGGATGTGGGCCTGTGGCATCTGGCCTTCTACAATGATGGAAAAGACAAAGAGATGGTT 1845 

Illlllllll lllllll IMM II MIIMIMM 11 IMMIII MM 

70 15966-697 



SBJCT: 
QUERY: 
SBJCT: 
QUERY: 
SBJCT : 
QUERY: 
SBJCT: 
QUERY: 
SBJCT: 
QUERY: 
SBJCT: 
QUERY: 
SBJCT: 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY : 
SBJCT: 
QUERY: 
SBJCT : 
QUERY : 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 



1714 TTGGATG' 



TGGGTT 



TTTGTGGCACCTGGCGTTTTACAATGATGGCAAt 



lG^^AA( 

\TmKco 



.GAAGTGGTC 1773 



1846 TCCTTCAAll^pGTTGTCCTAGATTCAGTGCAGGACTGTCCACGTJWPrGCCATGGGAAT 

IIIIMI III III I I lllllllllll llllllllllllll II lllll III 
1774 TCCTTCAGTACAGTTATTTTGGATTCAGTGCAAGACTGTCCACGTAATTGTCATGGCAAT 



1905 



1833 



1906 GGTGAATGTGTGTCCGGGGTGTGTCACTGTTTCCCAGGATTTCTAGGAGCAGACTGTGCT 1965 

II M lllll II II II II llllllll II Illllll llllllll llllll 
1834 GGCGAGTGTGTTTCTGGTGTCTGCCACTGTTTTCCCGGATTTCATGGAGCAGATTGTGCT 1893 

1966 AAAGCTGCCTGCCCTGTCCTGTGCAGTGGGAATGGACAATATTCTT^GGGACGTGCCAG 2025 

IIIIMIMIIIII M llllillllll Mill II II II lllll M III I 

1894 AAAGCTGCCTGCCCGGTGCTGTGCAGTGGCAATGGTCAGTACTCCAAAGGAACCTGCTTG 1953 

2026 TGCTACAGCGGCTGGAAAGGTGCAGAGTGCGACGTGCCCATGAATCAGTGCATCGATCCT 2085 

IIIMIII llllllllllll I II II M II illll I lllll II Mill 
1954 TGCTACAGTGGCTGGAAAGGTCCGGAATGTGATGTACCCATCAGCCAGTGTATTGATCCC 2013 



2086 



2145 



TCCTGCGGGGGCCACGGCTCCTGCATTGATGGGAACTGTGTCTGCTCTGCTGGCTACAAA 

II II II II II II llllllll II llllllllllllll II MUM III 
2014 TCGTGTGGAGGTCATGGTTCCTGCATCGAAGGGAACTGTGTCTGTTCCATTGGCTATAAA 2073 

2146 GGCGAGCACTGTGAGGAAGTTGATTGCTTGGATCCCACCTGCTCCAGCCACGGAGTCTGT 2205 

II II II II llllll Mlllll II II I lllll II Illllll lllll Mini 

2074 GGAGAAAACTGTGAGGAAGTTGATTGCTTAGATCCAACATGCTCCAATCACGGGGTCTGT 2133 



2206 



2265 



GTGAATGGAGAATGCCTGTGCAGCCCTGGCTGGGGTGGTCTGAACTGTGAGCTGGCGAGG 

lllll llllllll II llllllll lllllllllll I IIIIMIIIII I II 
2134 GTGAACGGAGAATGTCTCTGCAGCCCAGGCTGGGGTGGAATAAACTGTGAGCTTCCCAGA 2193 

2266 GTCCAGTGCCCAGACCAGTGCAGTGGGCATGGCACGTACCTGCCTGACACGGGCCTCTGC 2325 

I lllllllllllllllllllllllllllllllll llllll IIIIMI II Mill 
2194 GCCCAGTGCCCAGACCAGTGCAGTGGGCATGGCACATACCTGTCTGACACCGGTCTCTGT 2253 

2326 AGCTGCGATCCCAACTGGATGGGTCCCGACTGCTCTGTTGAAGTGTGCTCAGTAGACTGT 23 85 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII llllllllllllll IIIIIIMI 
2254 AGCTGCGATCCCAACTGGATGGGTCCCGACTGCTCCGTTGAAGTGTGCTCTGTAGACTGT 2313 

2386 GGCACTCACGGCGTCTGCATCGGGGGAGCCTGCCGCTGTGAAGAGGGCTGGACAGGCGCA 2445 

lllll II II II lllll II lllll II lllllllllll II llllllll I 
2314 GGCACCCATGGGGTGTGCATTGGCGGAGCGTGTCGCTGTGAAGAAGGGTGGACAGGAGTG 23 73 

244 6 GCGTGTGACCAGCGCGTGTGCCACCCCCGCTGCATTGAGCACGGGACCTGTAAAGATGGC 2505 

llllllllllllll lllll II lllll II I llllllll II lllllllllll 
2374 GCGTGTGACCAGCGTGTGTGTCATCCCCGGTGTACAGAGCACGGAACTTGTAAAGATGGG 2433 

2506 AAATGTGAATGCCGAGAGGGCTGGAATGGTGAACACTGCACCATTGGTAGGCAAACGGCA 2565 

llllllllllll llllllllllllllll II llllllllllllllllllllllll II 
2434 AAATGTGAATGCAGAGAGGGCTGGAATGGGGAGCACTGCACCATTGGTAGGCAAACGACA 24 93 

2 566 GGCACCGAAACAGATGGCTGCCCTGACTTGTGCAACGGTAACGGGAGATGCACACTGGGT 2625 

lllllllllllllllllllllllllllllllllll II llllllll lllll lllll 
24 94 GGCACCGAAACAGATGGCTGCCCTGACTTGTGCAATGGCAACGGGAGGTGCACGCTGGGC 2553 

2626 CAGAACAGCTGGCAGTGTGTCTGCCAGACCGGCTGGAGAGGGCCCGGATGCAACGTTGCC 2685 

llllllllllllllllilllllllllllllllllllllllllll llllllllllillll 
2554 CAGAACAGCTGGCAGTGTGTCTGCCAGACCGGCTGGAGAGGGCCTGGATGCAACGTTGCC 2613 

2686 ATGGAAACTTCCTGTGCTGATAACAAGGATAATGAGGGAGATGGCCTGGTGGATTGTTTG 2745 

IMIIIII llllllll llllllllllllll llllllllllll MM II II I 
2614 ATGGAAACCTCCTGTGCCGATAACAAGGATAACGAGGGAGATGGCTTGGTTGACTGCCTA 2673 

274 6 GACCCTGACTGCTGCCTGCAGTCAGCCTGTCAGAACAGCCTGCTCTGCCGGGGGTCCCGG 2805 

I III II llllllll lllli I lllll lllllllllll lillllM lllll 

2674 GTCCCAGATTGCTGCCTCCAGTCCACTTGTCAAAACAGCCTGCTGTGCCGGGGTTCCCGC 2733 

2806 GACCCACTGGACATCATTCAGCAGGGCCAGACGGATTGGCCCGCAGTGAAGTCCTTCTAT 2865 

II II II llllllll II III MM I I II II II llllllll llllll 
2734 GATCCTCTTGACATCATACAACAGAGCCATTCTGGTTCACCAGCTGTGAAGTCATTCTAT 2793 



71 
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10 



15 



20 



QUERY: 
SBJCT: 
QUERY: 
SBJCT: 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 
QUERY: 
SBJCT : 



2866 GACCGTATCAAGCTCTTGGCAGGCAAGGATAGCACCCACATCATTQ 

II II iltJMllll I II Mill Mill II llllll 
2794 GATCGAATdMBTCTTAGTCGGGAAGGACAGCACTCATATCATTl 



'CgGG 



^SGAGAGAACCCT 2 92 5 

II II 
lAGAAAATCCC 2853 



2926 TTCAACAGCAGCTTGGTTTCTCTCATCCGAGGCCAAGTAGTAACTACAGATGGAACTCCC 2985 

IIIIIIIIIMI I II lllil M llllllllll II lllillllllllll II 

2854 TTCAACAGCAGCCTTGTGTCTCTTATAAGAGGCCAAGTGGTGACTACAGATGGAACGCCT 2913 

2986 CTGGTCGGTGTGAACGTGTCTTTTGTCAAGTACCCAAAATACGGCTACACCATCACCCGC 304 5 

II II II II llllllll IMIIIIIIII Mill II Mill llllllll II 
2914 CTAGTTGGGGTCAACGTGTCATTTGTCAAGTATCCAAAGTATGGCTATACCATCACTCGT 2973 

3046 CAGGATGGCACGTTCGACCTGATCGCAAATGGAGGTGCTTCCTTGACTCTACACTTTGAG 3105 

llllllllll III III II I M II II II I III I III I llllllll 
2974 CAGGATGGCATGTTTGACTTGGTTGCTAACGGTGGATCATCCCTAACTTTGCACTTTGAA 3 033 

3106 CGAGCCCCGTTCATGAGCCAGGAGCGCACTGTGTGGCTGCCGTGGAACAGCTTTTACGCC 3165 

II Mill II Mill Mill I II II llllllllllllllllllll II III 

3034 CGGGCCCCATTTATGAGTCAGGAAAGGACAGTATGGCTGCCGTGGAACAGCTTCTATGCC 3093 



25 



vH30 



= ? 



M 35 



QUERY: 3166 ATGGACACCCTGGTGATGAAGACCGAGGAGAACTCCATCCCCAGCTGTGACCTCAGTGGC 322 5 

llllllll II II Mill II llllllllllllll lllllllllll lllllllll 
SBJCT: 3094 ATGGACACGCTTGTAATGAAAACAGAGGAGAACTCCATTCCCAGCTGTGATCTCAGTGGC 3153 

QUERY: 3226 TTTGTCCGGCCTGATCCAATCATCATCTCCTCCCCACTGTCCACCTTCTTTAGTGCTGCC 3285 

llllll I lllllllll IIMIM II II llllllll II Mill MM III 

SBJCT: 3154 TTTGTCAGACCTGATCCAGTCATCATTTCATCACCACTGTCAACTTTCTTCAGTGATGCT 3213 
QUERY: 3286 CCTGGGCAGAATCCCATCGTGCCTGAGACCCAGGTTCTTCATGAAGAAATCGAGCTCCCT 334 5 

Mill I Mill II II II II lllllllllllllllllllllll III Mill 

SBJCT: 3214 CCTGGCCGAAATCCTATTGTACCAGAAACCCAGGTTCTTCATGAAGAAATTGAGGTCCCT 3273 

QUERY: 3346 GG 3347 
II 

SBJCT: 3274 GG 3275 



CI 40 



SCORE = 547 BITS (276), EXPECT 
IDENTITIES = 540/628 (85%) 
STRAND = PLUS / PLUS 



E-152 



45 



50 



55 



60 



65 



70 



QUERY: 


782 


SBJCT : 


587 


QUERY: 


842 


SBJCT : 


647 


QUERY: 


902 


SBJCT : 


707 


QUERY: 


962 


SBJCT : 


767 


QUERY: 


1022 


SBJCT : 


827 


QUERY: 


1082 


SBJCT : 


887 


QUERY: 


1142 


SBJCT : 


947 


QUERY: 


1202 



GTCGTCCCATTCCACCTACATCCTCGCCTAGTCTCCTCCCATCTGCTCAGCTGCCTAGCT 841 

llllllllllllllllllllllllll MM II llllllllllllllllllll II I 
GTCGTCCCATTCCACCTACATCCTCGTCTAGCCTTCTCCCATCTGCTCAGCTGCCCAGTT 646 

CCCATAATCCTCCACCAGTTAGCTGCCAGATGCCATTGCTAGACAGCAACACCTCCCATC 901 

I MIMMMMMMMMMMMMMMMMMMIMMMI II lllllll 

CTCATAATCCTCCACCAGTTAGCTGCCAGATGCCATTGCTAGACAGCAATACGTCCCATC 706 

AAATCATGGACACCAACCCTGATGAGGAATTCTCCCCCAATTCATACCTGCTCAGAGCAT 961 

llllllllllllllll Mill Mill Mill II lllllllllll II lllllll 
AAATCATGGACACCAATCCTGACGAGGAGTTCTCTCCTAATTCATACCTACTAAGAGCAT 766 

GCTCAGGGCCCCAGCAAGCCTCCAGCAGTGGCCCTCCGAACCACCACAGCCAGTCGACTC 1021 

I llllllll Mill II lllilllllllllll I Mill lllllllllll II I 
GTTCAGGGCCACAGCAGGCATCCAGCAGTGGCCCTTCAAACCATCACAGCCAGTCAACGC 826 

TGAGGCCCCCTCTCCCACCCCCTCACAACCACACGCTGTCCCATCACCACTCGTCCGCCA 1081 

lllllll llllllll II IIIIIIIIMII MMIMMMM llllllll MM 

TGAGGCCACCTCTCCCCCCTCCTCACAACCACTCGCTGTCCCATCATCACTCGTCTGCCA 886 
ACTCCCTCAACAGGAACTCACTGACCAATCGGCGGAGTCAGATCCACGCCCCGGCCCCAG 1141 

IIIIMIIIIIIIIIIIII II Mill II II I IIIIMIIIII II II II I 

ACTCCCTCAACAGGAACTCGCTCACCAACCGCCGCAACCAGATCCACGCGCCTGCTCCCG 946 

CGCCCAATGACCTGGCCACCACACCAGAGTCCGTTCAGCTTCAGGACAGCTGGGTGCTAA 1201 

I llllllllllllll Mill II Mill II Mill lllllllllllllllll I 
CTCCCAATGACCTGGCGACCACGCCTGAGTCTGTGCAGCTGCAGGACAGCTGGGTGCTCA 1006 



72 



15966-697 



SBJCT: 1007 



IIMIII 

ACAGCAAf 



IILUI 



IIIIMII MM III I II lllll 

IGCTGGAGACCAGGCATTTCTTGTTTAAGACA' 



1^ I 

.i^Btgi 



I I III 
;GAACGACTC 1066 



QUERY: 1262 CCTTGTTCAGCAGCTCTTCCCCGGGATACCCTTTGACCTCAGGAACGGTTTACACGCCCC 1321 

I IIIIMI IIIIIIIIMI II lllll lllllllllllll lllll II II I 

SBJCT: 1067 CGCTGTTCAGTAGCTCTTCCCCTGGCTACCCACTGACCTCAGGAACAGTTTATACTCCAC 1126 



QUERY: 1322 CGCCCCGCCTGCTGCCCAGGAATACTTTCTCCAGGAAGGCTTTCAAGCTGAAGAAGCCCT 1381 

I III I III I II II lllll II MIIMII II IIIMIIMII IIIMM 

SBJCT: 1127 CTCCCAGGCTGTTACCTAGAAATACATTTTCCAGGAATGCATTCAAGCTGAAAAAGCCCT 1186 
QUERY: 1382 CCAAATACTGCAGCTGGAAATGTGCTGC 1409 

MM II II lllllllllllllllll 

SBJCT: 1187 CCAAGTATTGTAGCTGGAAATGTGCTGC 1214 

SCORE = 391 BITS (197), EXPECT = E-105 
IDENTITIES = 593/725 (81%) 
STRAND = PLUS / PLUS 

QUERY: 7156 CATGTCTACAATCACTCCAACTCGGAGATTACCTCACTGTACTACGACCTCCAGGGCCAC 7215 

llllllllllllll IIIM II II llllllll Mill II II II II llllll 

SBJCT: 7084 CATGTCTACAATCATTCCAATTCAGAAATTACCTCTCTGTATTATGATCTGCAAGGCCAC 7143 



QUERY: 7216 CTCTTTGCCATGGAGAGCAGCAGTGGGGAGGAGTACTATGTTGCCTCTGATAACACAGGG 7275 

llllllll llllllll lllllllllll II II lllll lllll llllllll M 

SBJCT: 7144 CTCTTTGCAATGGAGAGTAGCAGTGGGGAAGAATATTATGTCGCCTCCGATAACACGGGC 7203 

QUERY: 7276 ACTCCTCTGGCTGTGTTCAGCATCAACGGCCTCATGATCAAACAGCTGCAGTACACGGCC 7335 

lllll II II II lllllllllll llllllllllllllllllll llllllll II 
SBJCT: 7204 ACTCCGCTAGCCGTATTCAGCATCAATGGCCTCATGATCAAACAGCTTCAGTACACTGCA 7263 

QUERY: 7336 TATGGGGAGATTTATTATGACTCCAACCCCGACTTCCAGATGGTCATTGGCTTCCATGGG 7395 

II II MMMMMMMMI lllll II MUM MM lllll llllllll 

SBJCT: 7264 TACGGAGAGATTTATTATGACTCAAACCCTGATTTCCAGCTGGTTATTGGGTTCCATGGA 7323 

QUERY: 73 96 GGACTCTATGACCCCCTGACCAAGCTGGTCCACTTCACTCAGCGTGATTATGATGTGCTG 74 55 

II II lllll II I lllll II lllll II II II I II II lllll II 
SBJCT: 7324 GGGCTGTATGATCCTTTAACCAAACTCGTCCATTTTACCCAAAGGGACTACGATGTCCTT 73 83 

QUERY: 7456 GCAGGACGATGGACCTCCCCAGACTATACCATGTGGAAAAACGTGGGCAAGGAGCCGGCC 7515 

II lllll lllll II II II II II llllllllllll III! II II II 

SBJCT: 7384 GCTGGACGCTGGACATCTCCTGATTACACAATGTGGAAAAACATTGGTAGAGAACCTGCT 7443 

QUERY: 7516 CCCTTTAACCTGTATATGTTCAAGAGCAACAATCCTCTCAGCAGTGAGCTAGATTTGAAG 7575 

lllll II lllll lllllllllll lllll llllllllll III II III I III 
SBJCT: 7444 CCCTTCAATCTGTACATGTTCAAGAGTAACAACCCTCTCAGCAATGAACTGGATCTAAAG 7503 

QUERY: 7576 AACTACGTGACAGATGTGAAAAGCTGGCTTGTGATGTTTGGATTTCAGCTTAGCAACATC 7635 

II II II llllllll lllllllllll IIIMIII llllllllllllllllllll 

SBJCT: 7504 AATTATGTAACAGATGTCAAAAGCTGGCTGGTGATGTTCGGATTTCAGCTTAGCAACATT 7563 
QUERY: 7636 ATTCCTGGCTTCCCGAGAGCCAAAATGTATTTCGTGCCTCCTCCCTATGAATTGTCAGAG 7695 

llllllllllllll Mill liiiiiii II III I lllll II II II I III 

SBJCT: 7564 ATTCCTGGCTTCCCTAGAGCAAAAATGTACTTTGTGTCACCTCCATACGAGCTGACTGAG 7623 
QUERY: 7696 AGTCAAGCAAGTGAGAATGGACAGCTCATTACAGGTGTCCAACAGACAACAGAGAGACAT 7755 

llllllll MM lllllllllll llllllll lllll IIIIIIMIII lllll 

SBJCT: 7624 AGTCAAGCGTGTGAAAATGGACAGCTAATTACAGGAGTCCAGCAGACAACAGAAAGACAC 7683 

QUERY: 7756 AACCAGGCCTTCATGGCTCTGGAAGGACAGGTCATTACTAAAAAGCTCCACGCCAGCATC 7815 

II II II llillllllll II lllllllllll llllll I II lllll II 
SBJCT: 7684 AATCAAGCTTTCATGGCTCTTGAGGGACAGGTCATATCTAAAAGATTACATGCCAGTATT 774 3 

QUERY: 7816 CGAGAGAAAGCAGGTCACTGGTTTGCCACCACCACGCCCATCATTGGCAAAGGCATCATG 7875 

MM llllllll lilllllllll II I III II II lllll lllll llllll 

SBJCT: 7744 AGAGAAAAAGCAGGCCACTGGTTTGCAACAAGCACTCCTATTATTGGGAAAGGAATCATG 7803 

QUERY: 7876 TTTGC 7880 
lllll 

SBJCT: 7804 TTTGC 7808 
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SCORE = 33 9 BITS (1 
IDENTITIES = 429/51 
STRAND = PLUS / PLUS 



• 



EXPECT = 2E-89 
%) 



QUERY 
SBJCT 
QUERY 
SBJCT 
QUERY 
SBJCT 
QUERY 
SBJCT 
QUERY 
SBJCT 
QUERY 
SBJCT 
QUERY 
SBJCT 
QUERY 
SBJCT 
QUERY 
SBJCT 



7967 ACTACCTGGACAAGATGCACTACAGCATCGAGGGCAAGGACACCCACTACTTTGTGAAGA 8026 

IIIMIIIII II llllllllllllllllllll lllll II lllllllllll MM 
7895 ACTACCTGGAAAAAATGCACTACAGCATCGAGGGGAAGGATACTCACTACTTTGTCAAGA 7954 

8027 TTGGCTCAGCCGATGGCGACCTGGTCACACTAGGCACCACCATCGGCCGCAAGGTGCTAG 8086 

I Mllllllllll lllllll lllll M I I MM III I lllll II I 
7955 TAGGCTCAGCCGATAGCGACCTCGTCACCCTCGCGATGACCAGCGGGAGGAAGGTCCTGG 8014 

8087 AGAGCGGGGTGAACGTGACCGTGTCCCAGCCCACGCTGCTGGTCAACGGCAGGACTCGAA 8146 

I lllll II Mlllllllll Mllllll II II II IIIIIM IIIIIMII 

8015 ACAGCGGAGTAAACGTGACCGTCTCCCAGCCAACCCTCCTTATCAACGGAAGGACTCGAC 8074 
8147 GGTTCACGAACATTGAGTTCCAGTACTCCACGCTGCTGCTCAGCATCCGCTATGGCCTCA 8206 

lllllll lllll lllll lllll Mill IIIMI III IIMMMI II MM 

8075 GGTTCACAAACATCGAGTTTCAGTATTCCACCCTGCTGATCAACATCCGCTACGGGCTCA 8134 
8207 CCCCCGACACCCTGGACGAAGAGAAGGCCCGCGTCCTGGACCAGGCGAGACAGAGGGCCC 8266 

II lllllll Mill II IIIIIIM II II II llllllll I III I MM 

813 5 CCGCCGACACGCTGGATGAGGAGAAGGCACGAGTGCTAGACCAGGCTCGGCAGCGAGCCC 8194 
8267 TGGGCACGGCCTGGGCCAAGGAGCAGCAGAAAGCCAGGGACGGGAGAGAGGGGAGCCGCC 8326 

MM MIIIIIMIIII IIIIIIMIII II MM II I lllll MUM 

8195 TGGGGTCGGCCTGGGCCAAAGAGCAGCAGAAGGCACGGGATGGCCGCGAGGGCAGCCGCG 8254 

8327 TGTGGACTGAGGGCGAGAAGCAGCAGCTTCTGAGCACCGGGCGCGTGCAAGGGTACGAGG 8386 

I lllll II II llllllll llllllllll III II I II lllll lllllll 
8255 TATGGACAGACGGAGAGAAGCAACAGCTTCTGAACACGGGAAGGGTTCAAGGTTACGAGG 8314 

83 87 GATATTACGTGCTTCCCGTGGAGCAATACCCAGAGCTTGCAGACAGTAGCAGCAACATCC 844 6 

lllllll II I II llllllll lllllllllll llllllllllllllllllllll 
8315 GATATTATGTCTTGCCTGTGGAGCAGTACCCAGAGCTAGCAGACAGTAGCAGCAACATCC 8374 

8447 AGTTTTTAAGACAGAATGAGATGGGAAAGAGGTAA 84 81 

MIIIIIIIIMIIIIIII lllllllllllllll 
83 75 AGTTTTTAAGACAGAATGAAATGGGAAAGAGGTAA 84 09 



SCORE = 323 BITS (163), EXPECT = lE-84 
IDENTITIES = 397/475 (83%) 
STRAND = PLUS / PLUS 

QUERY: 299 GACACCGCTCTTTGACCAGAGGACGCTGTGGCAAAGAGTGTCGCTACACAAGCTCCTCTC 

IIIIIIIIMIIIIII lllll II II II II lllllllllll II II II II I 
SBJCT : 20 GACACCGCTCTTTGACGAGAGGCCGGTGCGGGAAGGAGTGTCGCTATACTAGTTCTTCAC 



358 



79 



QUERY: 359 TGGACAGTGAGGACTGCCGGGTGCCCACACAGAAATCCTACAGCTCCAGTGAGACTCTGA 418 

I IMIMII llllll lllll I lllll llllllllllllllllllll MM 
SBJCT: 80 TCGACAGTGAAGACTGCAGAGTACCAGCTCAGAAGTCCTACAGCTCCAGTGAGACCCTGA 139 

QUERY: 419 AGGCCTATGACCATGACAGCAGGATGCACTATGGAAACCGAGTCACAGACCTCATCCACC 4 78 

I II MM llllllll lllllllllll Mill lllll lllllll I III 

SBJCT: 140 AAGCATATGGCCATGACACGAGGATGCACTACGGAAATCGAGTTTCAGACCTGGTTCACA 199 

QUERY: 4 79 GGGAGTCAGATGAGTTTCCTAGACAAGGAACCAACTTCACCCTTGCCGAACTGGGCATCT 538 

lllllll lllllllllll II llllllll IIIIIMIIIIIII IIIIIIM MM 
SBJCT: 200 GGGAGTCGGATGAGTTTCCAAGGCAAGGAACGAACTTCACCCTTGCAGAACTGGGAATCT 259 

QUERY: 539 GTGAGCCCTCCCCACACCGAAGCGGCTACTGCTCCGACATGGGGATCCTTCACCAGGGCT 598 

llllllllll II II lllll IIIIIMIIII Mill II II II II II MM 
SBJCT: 260 GTGAGCCCTCTCCCCATCGAAGTGGCTACTGCTCGGACATAGGAATACTCCATCAAGGCT 319 



QUERY: 599 ACTCCCTTAGCACAGGGTCTGACGCCGACTCCGACACCGAGGGAGGGATGTCTCCAGAAC 658 

I III I lllll II lllll II Mill lllll lllll llllllllllllll I 
SBJCT: 320 ATTCCTTGAGCACTGGCTCTGATGCTGACTCAGACACGGAGGGCGGGATGTCTCCAGAGC 379 



QUERY: 659 ACGCCATCAGACTGTGGGGCAGAGGGATAAAATCCAGGCGCAGTTCCGGCCTGTCCAGTC 718 

74 15966-697 



MM MMIIIMMM MMMM MMMM M MMI^MMI MM 

SBJCT: 380 ACGCGATCAG^^ETGGGGAAGAGGGATCAAATCCAGCCGAAGTTC^^CTGTCAAGTC 439 



20 



r: 40 



55 



QUERY: 719 GTGAAAACTCX5G'CCCTTACCCTGACTGACTCTGACAACGAAAAC7\AATCAGATGA 773 

5 MMMMMMI M M M MMMM MMI M MIM MMMM 

SBJCT: 440 GTGAAAACTCGGCTCTCACGCTCACTGACTCCGACAATGAGAACAAGTCAGATGA 4 94 

The full FCTR3a amino acid sequence also has 342 of 383 amino acid residues (89%) 
identical to, and 342 of 383 residues (89%) positive with, the 276 amino acid residue Odd 
10 Oz/ten-m homolog 2 (Drosophila) (GenBank Acc: NP_035986.2) (SEQ ID NO:68) (Table 3P). 

Table 3P. BLASTP of FCTR3a against Odd Oz/ten-m homolog 2 - (SEQ ID NO:68) 

> GI|7657415|RSF|NP 035986.21 ODD OZ/TEN-M HOMOLOG 2 (DROSOPHILA) ; ODD OZ/TEN-M HOMOLOG 
3 

(DROSOPHILA) [MUS MUSCULUS] 
15 GII4760778 |DBJ|BAA77397.1| (AB025411) TEN-M2 [MUS MUSCULUS] 

LENGTH = 2764 



SCORE = 495 BITS (1274), EXPECT = E-139 

IDENTITIES = 342/383 (89%), POSITIVES = 342/383 (89%), GAPS = 41/383 (10%) 
QUERY: 3 7 HNPPPVSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPPNHHSQSTL 96 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

C=l SBJCT: 189 HNPPPVSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPPNHHSQSTL 24 8 

ill 25 QUERY: 97 RPPLPPPHNHTLSHHHSSANSLNRNSLTNRRSQIHAPAPAPNDLATTPESVQLQDSWVLN 156 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

SBJCT: 24 9 RPPLPPPHNHTLSHHHSSANSLNRNSLTNRRSQIHAPAPAPNDLATTPESVQLQDSWVLN 308 
QUERY: 157 SNVPLETRHFLFKTSSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPS 216 

:r| 30 MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

SBJCT: 309 SNVPLETRHFLFKTSSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPS 368 
£ QUERY: 217 KYCSWKCAALSAIAAALLLAILLAYFI 243 

□ MMMIMMMMMMMMMM 

i.i 35 SBJCT: 369 KYCSWKCAALSAIAAALLLAILLAYFIAMHLLGLNWQLQPADGHTFNNGVRTGLPGNDDV 428 
QUERY: 244 VPWSLKNSSIDSGEAEVGRRVTQEVPPGVFWRSQIHISQPQFLKFNISLGKD 295 

W MMMMMMMMMMMMMMMMMMMMMMMMMM 

rl SBJCT: 429 ATVPSGGKVPWSLKNSSIDSGEAEVGRRVTQEVPPGVFWRSQIHISQPQFLKFNISLGKD 488 



QUERY: 296 ALFGVYIRRGLPPSHAQYDFMERLDGKEKWSWESPRERRSIQTLVQNEAVFVQYLDVGL 355 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

SBJCT: 489 ALFGVYIRRGLPPSHAQYDFMERLDGKEKWSWESPRERRSIQTLVQNEAVFVQYLDVGL 548 



45 QUERY: 356 WHLAFYNDGKDKEMVSFNTWLD 378 

MMMMMMMMMMMI 

SBJCT: 54 9 WHLAFYNDGKDKEMVSFNTWLD 571 

The full FCTR3b amino acid sequence has 2442 of 2802 amino acid residues (87%) 
50 identical to, and 2532 of 2802 residues (90%) positive with, the 2802 amino acid residue 
teneurin-2 [Gallus gallus] (GenBank Acc: AJ279031) (SEQ ID NO:69) (Table 3Q). 

Table 3Q. BLASTP of FCTR3a against Teneurin-2 - (SEQ ID NO:69 



> GI 1 1024 1574 I EMB I CAC09416 . 1 1 (AJ279031) TENEURIN-2 [GALLUS GALLUS] 
LENGTH = 2802 

SCORE = 4853 BITS (12589), EXPECT =0.0 

IDENTITIES = 2510/2802 (87%), POSITIVES = 2600/2802 (90%), GAPS = 69/2802 (2%) 
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361 


QUERY : 
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472 
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541 
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532 
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601 
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592 
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661 


QUERY : 


652 


SBJCT : 


721 


QUERY: 


712 


SBJCT: 


781 


QUERY: 


772 


SBJCT : 


841 


QUERY : 


832 


SBJCT: 


901 


QUERY: 


892 


SBJCT : 


961 


QUERY: 


952 



MDVKDRRHRSLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSET 

• llllllllllllllllllllll lllllllll 

SDEFVPF 



iHDSRMHYGNR 

Ihlllllll 
GHDTRMHYGNR 



VTDLIHRESDEFVPRQGTNFTLAELGICEPSPHRSGYCSDMGILHQGYSLSTGSDADSDTE 

kl^llMIIIIIIIIIMIMIillMIIIMIIMhMIIIIIIIIIIIMIIIIi 

VSDLVHRESDEFPRQGTNFTLAELGICEPSPHRSGYCSDIGILHQGYSLSTGSDADSDTE 
GGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTDSDNENKSDDENG 

llllllllllllllllll IIIMIIIIIIIIIMIIMIIIil II 

GGMSPEHAIRLWGRGIKSSRSSGLSSRENSALTLTDSDNENKSDEENDFHTHLSEKLKDR 

RPIPPTSSPSLLPSAQLPSSHNPPPVSCQMPLLDSNTSHQIMDT 

llllllll IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 
QTSWQQLAETKNSLIRRPIPPTSSSSLLPSAQLPSSHNPPPVSCQMPLLDSNTSHQIMDT 



60 



60 



120 



120 



168 



180 



212 



240 



NPDEEFSPNSYLLRACSGPQQASSSGPPNHHSQSTLRPPLPPPHNHTLSHHHSSANSLNR 272 

IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIII 

NPDEEFSPNSYLLRACSGPQQASSSGPSNHHSQSTLRPPLPPPHNHSLSHHHSSANSLNR 300 

XXXXXXXXQIHAPAPAPNDLATTPESVQLQDSWVIoNSNVPLETRHFLFKXXXXXXXXXXX 332 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 
NSLTNRRNQIHAPAPAPNDLATTPESVQLQDSWVLNSNVPLETRHFLFKTSSGTTPLFSS 360 

XXXXYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWKCXXXXXXXXXXXXXXXXX 3 92 

lllllllllllllllllllllll IIIIIIIIMIIIII 
SSPGYPLTSGTVYTPPPRLLPRNTFSRNAFKLKKPSKYCSWKCAALSAIAAAVLLAILLA 420 

YFIV PWSLKNSSIDSGEAE 411 

III II ^llllllll I 

YFIAMHLLGLNWQLQPADGHTFSNGLRPGAAGAEDGAAAPPAGRGPWVTRNSS IDSGETE 4 80 

VGRRVTQEVPPGVFWRSQIHISQPQFLKFNISLGKDALFGVYIRRGLPPSHAQYDFMERL 471 

llhlMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

VGRKVTQEVPPGVFWRSQIHISQPQFLKFNISLGKDALFGVYIRRGLPPSHAQYDFMERL 54 0 
IX3KEKWSWESPRERRSIQTLVQNEAVFVQYLDVGLWHIAFYNDGKDKEMVSFNTVVLDS 531 

II III II I II II Mill I II llllllll II I INI MM II II III II kllkIM II 

DGKEKWSWESPRERRSIQTLVQNEAVFVQYLDVGLWHLAFYNDGKDKEWSFSTVILDS 600 

VQDCPRNCHGNGECVSGVCHCFPGFLGADCAKAACPVLCSGNGQYSKGTCQCYSGWKGAE 591 

lllllllllllllllllllllllll llllllllllllllllllllllll IIMIII I 
VQDCPRNCHGNGECVSGVCHCFPGFHGADCAKAACPVLCSGNGQYSKGTCLCYSGWKGPE 660 

CDVPMNQC I D P S CGGHGS C I DGNCVCSAG YKGEHCE E VDCLDPTCS SHGVCVNGECLCS P 651 

lllkHIIIIIIIIIIIII^IIIIII mmi-himiiiimmi^mmimiimm 

CDVPISQCIDPSCGGHGSCIEGNCVCSIGYKGENCEEVDCLDPTCSNHGVCVNGECLCSP 72 0 

GWGGLNCELARVQCPDQCSGHGTYLPDTGLCSCDPNWMGPDCSVEVCSVDCGTHGVCIGG 711 

lllkllll I lllllllllllll llllllllllllllllllllllllllllllllll 
GWGGINCELPRAQCPDQCSGHGTYLSDTGLCSCDPNWMGPDCSVEVCSVDCGTHGVCIGG 780 

ACRCEEGWTGAACDQRVCHPRCIEHGTCKDGKCECREGWNGEHCTIGRQTAGTETDGCPD 771 

llllllllll llllllillll llllllllllililMIIIIIIIIIII lllllllll 
ACRCEEGWTGVACDQRVCHPRCTEHGTCKDGKCECREGWNGEHCTIGRQTTGTETDGCPD 840 

LCNGNGRCTLGQNSWQCVCQTGWRGPGCNVAMETSCADNKDNEGDGLVDCLDPDCCLQSA 831 

IIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII IIMIII 
LCNGNGRCTLGQNS WQCVCQTGWRGPGCNVAMETSCADNKDNEGDGLVDCLVPDCCLQST 900 

CQNSLLCRGSRDPLDIIQQGQTDWPAVKSFYDRIKLLAGKDSTHIIPGENPFNSSLVSLI 891 

MIIIIIIIIIIMIIMI ^ MIIIIIMMM I II II I M I II 1 1 1 1 1 1 1 1 1 II 

CQNSLLCRGSRDPLDIIQQSHSGSPAVKSFYDRIKLLVGKDSTHIIPGENPFNSSLVSLI 960 
RGQWTTDGTPLVGVNVSFVKYPKYGYTITRQDGTFDLIANGGASLTLHFERAPFMSQER 951 

MMMMMMMMIMMMMIMMIIII M hi M hM II M 1 1 1 1 II II 1 1 

RGQWTTDGTPLVGVNVSFVKYPKYGYTITRQDGMFDLVANGGSSLTLHFERAPFMSQER 1020 
TVWLPWNSFYAMDTLVMKTEENSIPSCDLSGFVRPDPIIISSPLSTFFSAAPGQNPIVPE 1011 

lilMMMMMMMMMMIMMIIMMMI^MMIMMM MMIIIII 
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QUERY : 


1792 
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/^T Tn T\ XT 

QUERY : 


1852 


SB JCT : 


1921 


QUERY : 


1912 


SB JCT : 


1981 


QUERY: 


1972 


SB JCT : 


2041 



TVWLPWNSFYAMDTLVMKTEENSIPSCDLSGFVRPDPVIISSPLS 



TQVIJIEEI^BPnVKLRYLSSRTAGYKSLLKITMTQSTVPLNLIT^EMVAVEGHLFQK 1071 

MIIIIMlTri^'^ll IIIIIIIIIIIMM INI IIIIIIHIIIIIIIMIIII 

TQVl^EEIEVPGSSIKLIYLSSRTAGYKSLLKIIMTQSLVPLNLIKVHLMVAVEGHLFQK 1140 
SFQASPNIiASTFIWDKTDAYGQRVYGLSDAWSVGFEYETCPSLILWEKRTALIiQGFELD 1131 

M IIIMI IIIIIIIIIMkMIIIIIMMIIMIIIIIIIIIIIillllllllll 

SFLASPNLAYTFIWDKTDAYGQKVYGLSDAWSVGFEYETCPSLILWEKRTALLQGFELD 1200 
PSNLGGWSLDKHHILNVKSGILHKGTGENQFLTQQPAIITSIMGNGRRRSISCPSCNGLA 1191 

MllllllllllklMIIIIIIII lllllillllhlllllllMIIMIIIIIIIM 

PSNLGGWSLDKHHVLNVKSGILHKGNGENQFLTQQPAVITSIMGNGRRRSISCPSCNGLA 1260 
EGNKLIAPVALAVGIDGSLYVGDFNYIRRIFPSRNVTSILELRNKEFKHSNNPAHKYYLA 1251 

MIIIIMIIIIIIIIIII + illlilllllllllilllMllllilllllllllllllll 

EGNKLLAPVALAVGIDGSLFVGDFNYIRRIFPSRNVTSILELRNKEFKHSNNPAHKYYLA 1320 
VDPVSGSLYVSDTNSRRIYRVKSLSGTKDLAGNSEWAGTGEQCLPFDEARCGDGGKAID 1311 

IIIIIIIIIIIIIIIMIklllhlllllllMMIIMIIIMilllllllllllkl 

VDPVSGSLYVSDTNSRRIYKVKSLTGTKDLAGNSEWAGTGEQCLPFDEARCGDGGKAVD 13 80 
ATI^SPRGIAVDra^GLMYFVDATMIRKVDQNGIISTLLGSNDLTAVRPLSCDSSMDVAQV 13 71 

lliilllilllll illllllllllllllllMMIillllllMMIIIIIilllhll 

ATLMSPRGIAVDKYGLMYFVDATMXRKVDQNGIISTLLGSNDLTAVRPLSCDSSMDVSQV 1440 
RLEWPTDLAVNPMDNSLYVLENNVXLRITENHQVSIIAGRPMHCQVPGIDYSLSKXXXXX 14 31 

llllillllklilllllllllllllllllllMIIIIIIIIMIIIIIIIIIII 

RLEWPTDLAVDPMDNSliYVLENNVILRITENHQVSIIAGRPMHCQVPGIDYSLSKLAIHS 1500 
XXXXXXXXXXXXTGVLYITETDEKKINRLRQVTTNGEICLLAGAASXXXXXXXXXXXXYS 14 91 

MIIIIHIIIIIIIIIIIilllllllllllMI II 

ALESASAIAISHTGVLYISETDEKKINRLRQVTTNGEICLLAGAASDCDCKNDVNCNCYS 1560 
GDDAYATDAILNSPSSIiAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPGEQE 1551 

III IMIIIIIIIIIIIIIIIIIIIIIMIMIIIMIIkklhlllllllllllll 

GDDGYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNRPILNSFNQYEAASPGEQE 1620 
L YVFNADG I HQ YTVS LVTGE YLYNFTYS TDNDVTEL I DNNGNS LK I RRDSSGM PRHLLMP 1611 

IlilllllllllklllllllllllllkiilllhHHMIIkllkllllllllll 

LYVFNADGIHQYTLSLVTGEYLYNFTYSSDNDVTEVMDSNGNSLKVRRDASGMPRHLLMP 1680 
DNQIITLTVGTNGGLKWSTQNLELGLMTYDGNTGLLATKSDETGWTTFYDYDHEGRLTN 1671 

lllkll lliilllilllll IIIIIIIMhIillllllllllllllllllllllll 

DNQIVTLAVGTNGGLKLVSTQTLELGLMTYNGNSGLLATKSDETGWTTFYDYDHEGRLTN 1740 
VTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNSYQLCN 1731 

lllllllllilllllllllllllilllllllMlllllllllllllllllllllllllll 

VTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNSYQLCN 1800 
NGTLRVMYANGMGISFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKV 1791 

llllllllllll llllllllllllhlllllllllMIIMIIIIIIIIIIIIIIIIII 

NGTLRVM YANGMS I S FHS E PHVLAGTVTPT IGRCN I S LPMENGLNS I EWRLRKEQ I KGKV 1860 
TIFGRKLRVHGRNLLSIDYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVN 1851 

hllllllllllllllllllllllllllllllllllllllllkllllllllllMIIII 

TVFGRKLRVHGRNLLS I D YDRN I RTEK I YDDHRKFTLRI I YDQLGRP FLWLPS SGLAAVN 1920 
VSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADGKVWSYSYLDKSMVLLLQSQRQYI 1911 

IIIIIMMIIIMIIMIIIIIIIIIIklllllllllll klhllllllllllllll 

VSYFFNGRLAGLQRGAMSERTDIDKQGRIISRMFADGKVWSYTYLEKSMVLLLQSQRQYI 1980 

FE YDSSDRLLAVTMPSVARHSMSTHTS IGYIRNI YNPPESNASVI FD YSDDGR I LKTS FL 1971 

Illllllll llllllllllllllllkllllllllllllllllllllllllllllllll 
FEYDSSDRLHAVTMPSVARHSMSTHTSVGYIRNIYNPPESNASVIFDYSDDGRILKTSFL 2040 

GTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKIGPLV 2031 

llllllllllilllllllllllllllllllllllllllllllllillllllillllllll 

GTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKIGPLV 2100 



;T|gS] 



DAPGRNPIVPE 1080 
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DKQIYRFSEEi 

lllllll 
DKQIYRFS 



EEGMVN. 

A' 



;MVNARFD YT YHDNS FR I AS I KP VI SETPL P VDL' 

II I Ml II I I II II 11 I I hi II I I II Mill 
TARED YTYHDNS FR I AS I KP 1 1 S ETPL P VDLY^ 




ISGKVEHFGKF 2091 

Illllllllll 
ISGKVEHFGKF 2160 



GVIYYDINQIITTAVMTLSKHFDTHGRIKEVQYEMFRSLMYWMTVQYDSMGRVIKRELKL 2151 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiMiiiiiiiiiiiiiii mill 

GVI YYDINQI ITTAVMTLSKHFDTHGRI KEVQYEMFRSLMYWMTVQYDSMGRVTKRELKL 2220 
GPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYS YDXXXXXXXXXXXXS VRLMPLRYDLRD 2211 

ilMMMMIMMMMIIMIIMMMMII IIIIIIIIIIMI 

GPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLHLLNPGNS VRLMPLRYDLRD 2280 
RITRLGDVQYKIDDDGYLCQRGSDI FEYNSKGLLTRAYNKASGWSVQYRYDGVGRRASYK 2271 

lllllll IIIIIIMIIIIIIHIIIIIIIIIIIIIIMMIIIIIklllll I 

RITRLGDIPYKIDDDGFLCQRGSDVFEYNSKGLLTRAYNKANGWNVQYRYDGLGRRASCK 2340 

TNLGHHLQYFYSDLHNPTRITHVYNHSNSEITSLYYDLQGHLFAMESSSGEEYYVASDNT 23 31 

llllllllllhllllllhllllllllllllllllllllllllllllllllllllllll 
TNLGHHLQYFYADLHNPTRVTHVYNHSNSEITSLYYDLQGHLFAMESSSGEEYYVASDNT 24 00 

GTPLAVFSINGLMIKQLQYTAYGEIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDV 2391 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIhlllllllllllllllllllllllll 

GTPLAVFS INGLM I KQLQ YTAYGE I YYDSNPDFQLV I GFHGGLYDPLTKLVHFTQRDYDV 2460 
LAGRWTSPDYTiVIWKNVGKEPAPFNLYMFKSNNPLSSELDLKNYVTDVKSWLVMFGFQLSN 2451 

III II lllllll! I MkllllllllllllllMhIIIII II llllllll II lllllll 

LAGRWTSPDYTMWKNIGREPAPFNLYMFKSNNPLSNELDLKNYVTDVKSWLVMFGFQLSN 2520 

IIPGFPRAKMYFVPPPYELSESQASENGQLITGVQQTTERHNQAFMALEGQVITKKLHAS 2511 

lllllllilllll llllkllll llllllllllllllllllllllllllll>MIII 
1 1 PGFPRAKMYFVS P P YELTESQACENGQL I TGVQQTTERHNQAFMALEGQVI S KRLHAS 2580 

IREKAGHWFATTTPIIGKGIMFAIKEGRVTTGVSSIASEDSRKVASVLNNAYYLDKMHYS 2571 

lllllllllll + IIIIIIIIIIM + llltlklllk+IIIIHIIIhl^llHIIII 

IREKAGHWFATSTPI IGKGIMFAVKKGRVTTGISSIATDDSRKIASVLNSAHYLEKMHYS 2640 
lEGKDTHYFVKIGSADGDLVTLGTTIGRKVLESGVNVTVSQPTLLVNGRTRRFTNIEFQY 2631 

llllllllllllllll Mill I llllkllllllllllllkllllllllllllll 

lEGKDTHYFVKIGSADSDLVTLAMTSGRKVLDSGVNVTVSQPTLLINGRTRRFTNIEFQY 2700 
STLLLS IRYGLTPDTLDEEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQ 2691 

IIIM+IIIIII lllllllllllllllllllhMIIIIIIIIIIIIIIMI + lllll 

STLLINIRYGLTADTLDEEKARVLDQARQRALGSAWAKEQQKARDGREGSRVWTDGEKQQ 2760 

LLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLRQNEMGKR 2733 

IMIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIMIIII 
LLNTGRVQGYEGYYVLPVEQYPELADSSSNIQFLRQNEMGKR 2802 

The FCTR3bcde and f amino acid sequences have 1524 of 2352 amino acid residues 
(64%) identical to, and 1881 of 2532 residues (79%) positive with, the amino acid residues 429- 
2771, 93 of 157 residues (59%) identical to and 1 18 of 157 residues (74%) positive v^th amino 
acid residues 1-155, and 59 of 152 residues (38%) identical to and 68 of 152 residues (43%) 
positive with amino acid residues 21 1-361 of Ten-m4 [Mus musculus] (ptnr: GenBank Acc: 
BAA77399.1) (SEQ ID NO:70) (Table 3R). 

Table 3R. BLASTP of FCTR3b, c, d, e, and f against Mw5 musculus Ten-m4 - (SEQ ID 

NO:70) 



> GI I 4760782 I DBJ|BAA773 99 . 1 I (AB025413) TEN-M4 [MUS MUSCULUS] 
LENGTH = 2771 



SCORE = 3089 BITS (8008), EXPECT = 0.0 
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IDENTITIES = 1524/2352 (64%), POSITIVES = 1881/2352 (79%)^^APS = 28/2352 (1%) 
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knssidsg: 
++I Mill 



RRVTQEVPPGVFWRSQIHISQPQFLKFNISLGT^CFGVYIRRGLPP 

+ IIII ^k-^lll llllh I I lllhllll II M hllll 

EDSFIDSGEIDVGRRASQKIPPGTFWRSQVFIDHPVHLKFNVSLGKAALVGIYGRKGLPP 



460 



SHAQYDFMERLDGK EKWSWESPRERRSIQTLVQNEAVFVQYLDVGLWHLAFYND 

II hlhl llh I h k I H IHIII l + llllllll 
SHTQFDFVELLDGRRLLTQEARSLEGPQRQSRGPVPPSSHETGFIQYLDSGIWHLAFYND 

GKDKEMVSFNTWLDSVQDCPRNCHGNGECVSGVCHCFPGFLGADCAKAACPVLCSGNGQ 

Ik hill I HI IkllMHI MM MM II HHIIMIIIII 

GKESEWSFLTTAIESVDNCPSNCYGNGDCISGTCHCFLGFLGPDCGRASCPVLCSGNGQ 
YSKGTCQCYSGWKGAECDVPMNQCIDPSCGGHGSCIDGNCVCSAGYKGEHCEEVDCLDPT 

I II I kMlilMMII Mill H Ikll I IH^ Mill lllllhIM 

YMKGRCLCHSGWKGAECDVPTNQCIDVACSSHGTCIMGTCICNPGYKGESCEEVDCMDPT 
CSSHGVCVNGECLCSPGWGGLNCELARVQCPDQCSGHGTYLPDTGLCSCDPNWMGPDCSV 

III MM Ml II MM III I I IMIIIIhllllllhllhl I Ilk 

CSSRGVCVRGECHCSVGWGGTNCETPRATCLDQCSGHGTFLPDTGLCNCDPSWTGHDCSI 
EVCSVDCGTHGVCIGGACRCEEGWTGAACDQRVCHPRCIEHGTCKDGKCECREGWNGEHC 

IH+ III nihil nihil iiiiiii iiiii iiiihiiiiii iiiiiii 

EICAADCGGHGVCVGGTCRCEDGWMGAACDQRACHPRCAEHGTCRDGKCECSPGWNGEHC 
********* 

TIGRQTAGTETDGCPDLCNGNGRCTLGQNSWQCVCQTGWRGPGCNVAMETSCADNKDNEG 

II HM llllllllll I IIIMMIIIh HII II llhl 

TIAHYLDRWKEGCPGLCNGNGRCTLDLNGWHCVCQLGWRGTGCDTSMETGCGDGKDNDG 
DGLVDCLDPDCCLQSACQNSLLCRGSRDPLDI IQQGQT- -DWPAVKSFYDRIKLLAGKDS 

lllllhllMMI I + II M IIIMIh I ^ IIIII! I hi! 

DGLVDCMDPDCCLQPLCHVNPLCLGSPDPLDIIQETQAPVSQQNLNPFYDRIKFLVGRDS 
THIIPGENPFNSSLVSLIRGQWTTDGTPLVGVNVSFVKYPKYGYTITRQDGTFDLIANG 

II Mllllh -^llllhhlllllMlhlh I Hllhlllhllh II 

THSIPGENPFDGGHACVIRGQVMTSDGTPLVGVNISFINNPLFGYTISRQDGSFDLVTNG 
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848 



873 



908 



933 
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GASLTLHFERAPFMSQERTVWLPWNSFYAMDTLVMKTEENS I PSCDLSGFVRPDP HISS 993 

I h I lllllhHI hlllh h hhlh Ml IIMIII I lhh+ I 

GISIILRFERAPFITQEHTLWLPWDRFFVMETIVMRHEENEIPSCDLSNFARPNPWSPS 102 8 
PLSTFFSAAPGQNPIVPETQVLHEEIELPGSNVKLRYLSSRTAGYKSLLKITMTQSTVPL 1053 

Ih+I h + IIIII IIIIIII +H llllll MlhhhH hi 

PLTSFASSCAEKGPIVPEIQALQEEIVIAGCKMRLSYLSSRTPGYKSVLRISLTHPTIPF 1088 
NLIRVHLMVAVEGHLFQKSFQASPNLASTFIWDKTDAYGQRVYGLSDAWSVGFEYETCP 1113 

IhHIIIIIIII Ihl I hhh IIIIIII I hhl hi lllhllhll 

NLMKVHLMVAVEGRLFRKWFAAAPDLSYYFIWDKTDVYNQKVFGFSEAFVSVGYEYESCP 1148 
SL I LWEKRTALLQGFELDPSNLGGWSLDKHHI LNVKSG I LHKGTGENQFLTQQPAI I TS I 1173 

llllllllhllhhl I llllllllll IhHilllll IlilhHIl H II 

DLILWEKRTAVLQGYEIDASKLGGWSLDKHHALNIQSGILHKGNGENQFVSQQPPVIGSI 1208 
MGNGRRRSISCPSCNGLAEGNKLLAPVALAVGIDGSLYVGDFNYIRRIFPSRNVTSILEL 1233 

lllllllllllllllllhllllllilll I llllllllllllllllll llhllh 

MGNGRRRSISCPSCNGLADGNKLLAPVALTCGSDGSLYVGDFNYIRRIFPSGNVTNILEM 1268 
RNKEFKHSNNPAHKYYLAVDPVSGSLYVSDTNSRRIYRVKSLSGTKDLAGNSEWAGTGE 1293 

llhhIhHIIIIIII Ihlh^^HMMIh-^HM ^ III Mllllllh 

RNKDFRHSHSPAHKYYLATDPMSGAVFLSDTNSRRVFKVKSTTWKDLVKNSEWAGTGD 1328 
QCLP FDEARCGDGGKAI DATLMS PRG I AVDKNGLM Y FVDATM I RKVDQNG 1 1 STLLGSND 1353 

lllllh llllllll HI! HIM III Ihllll lllhlllllllllllllll 

QCLP FDDTRCGDGG KATEATLTNPRG I TVDKFGL I Y FVDGTM I RR VDQNG 1 1 STLLGSND 1388 
LTAVRPLSCDSSMDVAQVRLEWPTDLAVNPMDNSLYVLENNVILRITENHQVSIIAGRPM 1413 

Ih IIIIIII h^Hlllllllllhlllllllllhllhhhlllll hlllll 

LTSARPLSCDSVMEISQVRLEWPTDLAINPMDNSLYVLDNNWLQISENHQVRIVAGRPM 1448 
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HCO VPG I D - YS LS KXXXXXXXXXXXXXXXXXTG VL Y I TETDE KK I 

•I Mill Mini 

KVAIHATLESATALAVSHNGVLYIAETDEKKI 



:QVTTNGEICLL 1472 

IIMHIl h 

IQVTTSGEISLV 1508 



AGAASXXXXXXXXXXXXYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKN 1532 

III I ^llll II II IklMIII M ^l^lllllllll + II 

AGAPSGCDCKMDANCDCFSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFIRKN 1568 

KPVLNAFNQYEAASPGEQELYVFNADGIHQYTVSLVTGEYLYNFTYSTDNDVTELIDNNG 1592 

II II I II HI HIMH^ I III II IklllMlk I hi ^ MM 

KP FLiNTQNM YELS S P I DQEL YL FDTSGKHL YTQS LPTGD YL YNFT YTGDGD I TH I TDNNG 1628 
NSLKIRRDSSGMPRHLLMPDNQI ITLTVGTNGGLKWSTQNLELGLMTYDGNTGLLATKS 1652 

I + HIII + lll h^l h HHII k hll II HM IIHIIIIII 

NMVNTORDSTGMPLWLWPIX3QVYWVTMGTNS ALRS VTTQGHELAMMTYHGNSGLIAT^ 1688 
DETGWTTFYDYDHEGRLTNVTRPTGWTS LHREME KS I T I D I ENSNRDDDVTVI TNLS S V 1712 

H lllllhll lllllll III Ml + h H kOllk lllh 

NENGWTTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSK-DDVTITTNLSAS 1747 
EASYTWQDQVRNSYQLCNNGTLRVMYANGMGISFHSEPHVLAGTITPTIGRCNISLPME 1772 

I II^HIIIIMI + HHh^ MM +^ HIklllk IM- h-^IM-^ 

GAFYTLLQDQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKRNVTLPID 1807 
NGLNS I EWRLRKEQ I KGKVT I FGRKLRVHGRNLLS I DYDRN I RTEK I YDDHRKFTLR 1 1 Y 1832 

MM HII MM HHkllkllll llllklHI IIIIIIIIIMIIIIhl 

NGLNLVEWRQRKEQARGQVTVFGRRLRVHNRNLLSLDFDRVTRTEKIYDDHRKFTLRILY 1867 
DQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADGKVWS 1892 

II III II III I IIIH I MMIII MM ^ k III Ihllllkll 

DQAGRPSLWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYDQAGRITSRIFADGKMWS 1927 

YSYLDKSMVLLLQSQRQYIFEYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESN 1952 

IHIMIIII I IIIIIIIM HII Hllhlll +^ I Ml Mil III I 
YTYLEKSMVLHLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGN 1987 

ASVIFDYSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVN 2012 

MM k-HI H I HItlhl IIIIIIIIIH Hhl Ml MM Ml II 

ASVIQDFTEDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETAGMLKTVN 2047 
LQSGGFSCTIRYRKIGPLVDKQIYRFSEEGMVNARFDYTYHDNSFRIASIKPVISETPLP 2072 

IM IhlllllhlllhlHhIIHIIIIIIIIII I llllh Ihlllll 

LQNEGFTCTIRYRQIGPLIDRQIFRFTEEGMVNARFDYNY-DNSFRVTSMQAVINETPLP 2106 

VDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRIKEVQYEMFRSLMY 2132 

HIIIM^-hIM I lllllllllllllllllllll HIM HIHIIIIIHIIIII 
IDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRMKEVQYEIFRSLMY 2166 

WMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDXXXXXX 2192 

IIIMIIHIIIHHIIHIIIIIIHHHI IIIIIH^MIH llllll 

WMTVQYDNMGRWKKELKVGP YANTTRYS YE YDADGQLQTVS INDKPLWRYS YDLNGNLH 2226 



XXXXXXSVRLMPLRYDLRDRITRLGDVQYKIDDDGYLCQRGSDIFEYNSKGLLTRAYNKA 

I II IIIIIIIIMIillllllkMIIH III IHIIII III HUH 

LLS PGNSARLTPLR YDLRDR I TRLGDVQ YKMDEDG FLRQRGGDVFE YNS AGLL I KAYNRA 



2252 



2286 



SGWSVQYRYDGVGRRASYKTNLGHHLQYFYSDLHNPTRITHVYNHSNSEITSLYYDLQGH 2312 

IIIIIHIIIIHII I h+ IIIIHMII llhHIHIIMIIIIIIIIIIIII 

SGWSVRYRYDGLGRRVSSKSSHSHHLQFFYADLTNPTKVTHLYNHSSSEITSLYYDLQGH 2346 
LFAMESSSGEEYYVASDNTGTPLAVFSINGLMIKQLQYTAYGEIYYDSNPDFQMVIGFHG 2372 

Mill IIIHHH II llllllll 111111+ llllllll MklkHIHI 

LFAMELS SGDE FY I ACDN I GTPLAVFSGTGLM I KQ I L YTAYGE I YMDTNPN FQ 1 1 IG YHG 2406 
GLYDPLTKLVHFTQRDYDVLAGRWTSPDYTMWKNVGKEP- APFNLYMFKSNNPLSSELDL 2431 

MMIIMMI HIIIIIIIIIIIIM HI + IIHIIIIHIIH+ h 

GLYDPLTKLVHMGRRDYDVLAGRWTS PDHELWKRLSSNS I VPFHLYMFKNNNP I SNSQD I 2466 
KNYVTDVKSWLVMFGFQLSNI I PGFPRAKMYFVPPPYELSESQAS ENGQLITGVQQ 2487 

I +HII llh Mill IHIIH+ + I III H H M IM 
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SBJCT 
QUERY 
SBJCT 
QUERY 
SBJCT 
QUERY 
SBJCT 
QUERY 
SBJCT 
QUERY 
SBJCT 



2467 KCFMTDVNSWLLTFGFQLHNVI PGYPKPDTDAME PS YELVHTQMK' 



DNSKSILGVQC 2526 



2488 TTERHNQA^ME GQVITKKLHASIREKAGHWFATTTPll^RlMFAIKEGRVT 2541 

Hl^ II I II I H Ih^ I 111+ IM + IIII * 

2527 EVQKQLKAFVTLERFDQLYGSTITSCQQAPETKK FASSGSIFGKGVKFALKDGRVT 2582 

2542 TGVSSIASEDSRKVASVIJmAYYLDKMHYSIEGKDTHYFVKIGSADGDLVTI/STTIGRK^ 2601 

I + M-ll |^H+HIIMI+ +h+hl llllllll ++III 11 + 11 + 

2583 TDIISVANEDGRRIAAILNNAHYLENLHFTIDGVDTHYFVKPGPSEGDLAILGLSGGRRT 2642 

2602 LESGVNVTVSQPTLLVNGRTRRFTNIEFQYSTLLLSIRYGLTPDTLDEEKARVLDQARQR 2661 

ll+llllllll +++IIIII+I+I+ II I 1+ III l+llll 111+ Mil 

2643 LENGVNVTVSQINTMLSGRTRRYTDIQIiQYRALCLNTRYG TTVDEEKVRVLELARQR 2699 

2662 ALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPEL7UDSSSN 2721 

1+ 111+111+ l+l II I II+IIIII+I+IIIIIII+I++I IIIIIII+II++I 
2700 AVRQAWAREQQRLREGEEGLRAWTDGEKQQVIiNTGRVQGYDGFFVTSVEQYPELSDSANN 2759 

2 722 IQFLRQNEMGKR 2733 

I l+ll+lll+l 
2760 IHFMRQSEMGRR 2771 



SCORE = 161 BITS (407), EXPECT = 2E-37 

IDENTITIES = 93/157 (59%), POSITIVES = 118/157 (74%), GAPS = 4/157 (2%) 

QUERY : 1 MDVKDRR-HRSLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGN 59 

IIII+I+ +IIIIII I IIIUI 111+ + I llllllllllllll 1+1+ 11+ 
SBJCT: 1 MDVKERKPYRSLTRRR-DAERRYTSSSADSEEGKGP-QKSYSSSETLKAYDQDARLAYGS 58 

QUERY: 60 RVTDLIHRESDEFPRQGTNFTLAELGICEPS-PHRSGYCSDMGILHQGYSLSTGSDADSD 118 

II I++ +I++II I IIIIIIIII+ I + II + I +1+1+ IIII+ MM + 
SBJCT: 59 RVKDMVPQEAEEFCRTGTNFTLRELGLGEMTPPHGTLYRTDIGLPHCGYSMGASSDADLE 118 

QUERY: 119 TEGGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLT 155 

+ +IMI +IIIII +1 Ml MM II MM 

SBJCT: 119 ADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLT 155 
SCORE =72.1 BITS (176), EXPECT = 8E-11 

IDENTITIES = 59/152 (38%), POSITIVES = 68/152 (43%), GAPS = 42/152 (27%) 
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QUERY 
SBJCT 
QUERY 
SBJCT 



285 PAPAPND--LATTP ESVQLQDSWVLNSNVPLETR 316 

l+lll I 1+ I I ll + l + llll + MMI 

211 PSPAPTDHSLSGEPPAGSAQEPTHAQDNWLLNSNIPLETRNLGKQPFLGTLQDNLIEMDI 270 

317 HFLFKXXXXXXXXXXXXXXXYPLTSGTVYTPPPRLLPRNTFSRKAFK 363 

Mill Mill lll+llll lll+llll II 

271 LSASRHDGAYSDGHFLFK-PGGTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFSRPAFN 329 

364 LKKPSKYCSWKCXXXXXXXXXXXXXXXXXYFI 395 

llllllll+IM 11+ 
330 LKKPSKYCNWKCAALSAILISATLVILLAYFV 361 



*FCTR3F DOES NOT CONTAIN THESE AMINO ACIDS 



The 997-2733 amino acid fragment of the FCTR3bcde and f protein was also found to 
have 1695 of 1737 amino acid residues (97%) identical to, and 1695 of 1737 residues (97%) 
positive with the amino a 1737 amino acid residue protein KI/\A1 127 protein [Homo sapiens] 
(GenBank Acc:(AB032953) (SEQ ID N0:71), (Table 3S). 

Table 3S. BLASTP of FCTR3b, c, d, e, and f against Homo sapiens KIAA1127 protein 

(SEQ ID NO:71) 

> GI|6329763 |DBJ|BAA86441 .1 I (AB0329S3) KIAA1127 PROTEIN [HOMO SAPIENS] 
LENGTH = 173 7 



81 



15966-697 



SCORE = 3295 BITS (8545), EXPECT = 0.0 

IDENTITIES = 1695/1 97%) , POSITIVES = 1695/1737 (97 
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TFFSAAPGQNPIVPETQVLHEEIELPGSNVKLRYLSSRTAGYKSLLKITMTQSTVPLNLI 1056 

IIIIIIIIIIIIMIIIMIillllllllllllllllllllllllilllllillllllll 

TFFSAAPGQNPIVPETQVLHEEIELPGSNVKLRYLSSRTAGYKSLLKITMTQSTVPLNLI 60 
RVHLMVAVEGHLFQKSFQASPNLASTFIWDKTDAYGQRVYGLSDAWSVGFEYETCPSLI 1116 

MlillllMlllllllllillll MIIIIIIMIilllllllilllllllllllllll 

RVHLMVAVEGHLFQKSFQASPNLAYTFIWDKTDAYGQRVYGLSDAWSVGFEYETCPSLI 120 
LWEKRTALLQGFELDPSNLGGWSLDKHHILNVKSGILHKGTGENQFLTQQPAIITSIMGN 1176 

IIIIMIIiMllllllllllllllllllilMIMIIIMMIIillllllllllllll 

LWEKRTALLQGFELDPSNLGGWSLDKHHILNVKSGILHKGTGENQFLTQQPAIITSIMGN 180 

GRRRSISCPSCNGLAEGNKLLAPVALAVGIDGSLYVGDFNYIRRIFPSRNVTSILELRNK 1236 

llllillllllllMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIilMIIIIIIIII 
GRRRSISCPSCNGLAEGNKLLAPVALAVGIDGSLYVGDFNYIRRIFPSRNVTSILELRNK 24 0 

EFKHSNNPAHKYYLAVDPVSGSLYVSDTNSRRI YRVKSLSGTKDLAGNSEWAGTGEQCL 12 96 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIM 

EFKHSNNPAHKYYLAVDPVSGSLYVSDTNSRRI YRVKSLSGTKDLAGNSEWAGTGEQCL 3 00 
P FDEARCGDGGKAI DATLMS PRG I AVDKNGLMYFVDATM I RKVDQNG 1 1 STLLGSNDLTA 1356 

IIIIIIIIIIIIIIIIIIIIIIIIIMIilMIIIIIIIIIIIIIIIIIIIIMIIIIII 

P FDEARCGDGGKA I DATLMS PRG I AVDKNGLMYFVDATM I RKVDQNG 1 1 STLLGSNDLTA 360 
VRPLSCDSSMDVAQVRLEWPTDLAVNPMDNSLYVLENNVILRITENHQVSIIAGRPMHCQ 1416 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIilllllllllllllMIIIIIIIIIIIIII 

VRPLSCDSSMDVAQVRLEWPTDLAVNPMDNSLYVLENNVILRITENHQVS I lAGRPMHCQ 420 
VPGIDYSLSKXXXXXXXXXXXXXXXXXTGVLYITETDEKKINRLRQVTTNGEICLLAGAA 1476 

llllllllll llllllllllillllllllllllllllllMII 

VPGIDYSLSKLAIHSALESASAIAISHTGVLYITETDEKKINRLRQVTTNGEICLLAGAA 480 

SXXXXXXXXXXXXYSGDDAYATDAI LNS PSSLAVAPDGT I Y I ADLGN I R I RAVS KNKPVL 1536 

I lllllllllllllllllllllllllllllllllllllllllllllll 
SDCDCKNDVNCNCYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVS KNKPVL 540 

NAFNQYEAASPGEQELYVFNADGIHQYTVSLVTGEYLYNFTYSTDNDVTELIDNNGNSLK 1596 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllilll 

NAFNQYEAASPGEQELYVFNADGIHQYTVSLVTGEYLYNFTYSTDNDVTELIDNNGNSLK 600 
IRRDSSGMPRHLLMPDNQIITLTVGTNGGLKWSTQNLELGLMTYDGNTGLLATKSDETG 1656 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

IRRDSSGMPRHLLMPDNQIITLTVGTNGGLKWSTQNLELGLMTYDGNTGLLATKSDETG 660 

WTTFYDYDHEGRLTNVTRPTGWTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASY 1716 

lllllllllllllllllllllllllilllllllllllllllllllllllllllllllMI 
WTTFYDYDHEGRLTNVTRPTGWTSLHREMEKSITIDIENSNRDDDVTVXTNLSSVEASY 720 

TWQDQVRNSYQLCNNGTLRVMYANGMGISFHSEPHVLAGTITPTIGRCNISLPMENGLN 1776 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIillllllllllllllMIMIIII 

TWQDQVRNSYQLCNNGTLRVMYANGMGISFHSEPHVLAGTITPTIGRCNISLPMENGLN 780 

S I E WRLR KEQ I KGKVT I FGRKLR VHGRNLLS I D YDRN I RTEK I YDDHRKFTLR I I YDQVG 1836 

IIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIilllllMIIIIIIII 
SIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSIDYDRNIRTEKIYDDHRKFTLRIIYDQVG 84 0 

RPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADGKVWSYSYL 1896 

lllllllllllliiilllllllllllllllllllllMllllliillllllMIIMIII 

RPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADGKVWSYSYL 900 
DKSMVLLLQSQRQYIFEYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVX 1956 

IIIIIIMIIIMIIIMIIIilllliMIIIIIMIIIIIIIIIIIIMIIIIIIMII 

DKSMVLLLQSQRQYIFEYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVI 960 
FDYSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSG 2016 

IMIilllllllllllllllllMIIIMIIIIIMIIIIIIIMIIIIIIIIIIIIIM 

FDYSDDGRI LKTS FLGTGRQVF YKYGKLSKLSE I VYDSTAVTFGYDETTGVLKMVNLQSG 1020 
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GFSCTIRY^^tPLVDKQIYRFSEEGMVNARFDYTYHDNSFRIA^^p^ISETPLPVDLY 2076 

1 1 1 1 1 1 1 1 Wl 1 1 1 i i 1 1 1 N 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 IWI I M 1 1 1 M 1 1 1 

GFSCTIRYRKIGPLVDKQIYRFSEEGMVNARFDYTYHDNSFRIASIKPVISETPLPVDLY 1080 



RYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRIKEVQYEMFRSLMYWMTV 2136 

IIIIIIIIIIIIIIIIMIIIIilllMIMIIIIIIIIIIIIIIIIIIIIIIIIMIII 
RYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRIKEVQYEMFRSLMYWMTV 1140 

QYDSMGRVI KRELKLGP YANTTKYTYDYDGDGQLQS VAVNDRPTWRYS YDXXXXXXXXXX 2196 

IIIIMIIIIIIIIIIIIMIIIIMMIMIilllllllllllllllM 

QYDSMGRVI KRELKLGP YANTTKYTYDYDGDGQLQS VAVNDRPTWRYS YDLNGNLHLLNP 1200 

XXS VRLMPLRYDLRDR I TRLGDVQ YKI DDDG YLCQRGSD I FE YNS KGLLTRAYNKASGWS 2256 

IIMIIIIIIIIIIIIIIIIillllllllllllllllllllllllMIIIIIIIIIII 
GNSVRLMPLRYDLRDRITRLGDVQYKIDDDGYLCQRGSDIFEYNSKGLLTRAYNKASGWS 1260 

VQYRYDGVGRRASYKTNLGHHLQYFYSDLHNPTRITHVYNHSNSEITSLYYDLQGHLFAM 2316 

iMIIIIIIIIIIIIIIIIIIIIillllillllllllllllllllllllllllllllill 

VQYRYDGVGRRASYKTNLGHHLQYFYSDLHNPTRITHVYNHSNSEITSLYYDLQGHLFAM 1320 

ESSSGEEYYVASDNTGTPLAVFSINGLMIKQLQYTAYGEIYYDSNPDFQMVIGFHGGLYD 2376 

llllllllllllllllllllilMlllllllllllllillilllllllllllllllllll 
ESSSGEEYYVASDNTGTPLAVFSINGLMIKQLQYTAYGEIYYDSNPDFQMVIGFHGGLYD 1380 

PLTKLVHFTQRDYDVLAGRWTSPDYTMWKNVGKEPAPFNLYMFKSNNPLSSELDLKNYVT 2436 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIillll 
PLTKLVHFTQRDYDVLAGRWTSPDYTMWKNVGKEPAPFNLYMFKSNNPLSSELDLKNYVT 144 0 

DVKSWLVMFGFQLSNIIPGFPRAKMYFVPPPYELSESQASENGQLITGVQQTTERHNQAF 2496 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
DVKSWLVMFGFQLSNIIPGFPRAKMYFVPPPYELSESQASENGQLITGVQQTTERHNQAF 1500 

MALEGQVITKKLHASIREKAGHWFATTTPIIGKGIMFAIKEGRVTTGVSSIASEDSRKVA 2556 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
MALEGQVI TKKLHAS IREKAGHWFATTTPI IGKGIMFAI KEGRVTTGVSS I ASEDSRKVA 1560 

SVLNNAYYLDKMHYSIEGKDTHYFVKIGSADGDLVTLGTTIGRKVLESGVNVTVSQPTLL 2616 

IIIMIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIi 

SVLNNAYYLDKMHYSIEGKDTHYFVKIGSADGDLVTLGTTIGRKVLESGVNVTVSQPTLL 162 0 

VNGRTRRFTNIEFQYSTLLLSIRYGLTPDTLDEEKARVLDQARQRALGTAWAKEQQKARD 2676 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
VNGRTRRFTNIEFQYSTLLLSIRYGLTPDTLDEEKARVLDQARQRALGTAWAKEQQKARD 1680 

GREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLRQNEMGKR 2733 

lllllillllllllllllillMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 
GREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLRQNEMGKR 1737 

The amino acid sequences of the FCTR3bcde and f proteins were also found to have 
2528 of 2774 amino acid residues (91%) identical to, and 2557 of 2774 residues (92%) positive 
with, the 2765 amino acid residue protein neurestin alpha [Rattus norvegicus] (GenBank 
Acc:AF086607) (SEQ ID NO:72), shown in Table 3T. 



Table 3T. BLASTP of FCTR3bcd and f against Rattus norvegicus Neurestin alpha (SEQ ID 

NO:72) 

> GI I 9910320 iREFlNP 064473.1 I NEURESTIN ALPHA [RATTUS NORVEGICUS] 
Gil 5712201|GB|AAD47383 .1 IAF086607 1 (AF086607) NEURESTIN ALPHA (RATTUS NORVEGICUS] 
LENGTH ^ 2765 

SCORE = 4988 BITS (12938), EXPECT = 0.0 

IDENTITIES = 2528/2774 (91%), POSITIVES = 2557/2774 (92%), GAPS ^ 50/2774 (1%) 
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Jlllllllllllllllllllllllllllllll 

CRCGKECRYTSSSLDSEDCRVPTQKSYSSSE' 



llllllil 

iDHDSRMHYGNR 



60 



120 



VTDLIHRESDEFPRQGTNFTLAELGICEPSPHRSGYCSDMGILHQGYSLSTGSDADSDTE 

IIII^IIIMII III lllllllllllllllllilMIIIIIIIIIIIIIIIIIIIIII 
VTDLVHRESDEFSRQGANFTLAELGICEPSPHRSGYCSDMGILHQGYSLSTGSDADSDTE 120 

GGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTXXXXXXXXXXXXGRXXXXXXXXXXX 180 
IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI M 

GGMS PEHAIRLWGRGI KSRRSSGLSSRENSALTLTDSDNENKSDDDNGRP I PPTSSSSLL 180 

XXXXXXXXHNPPPVSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACXXXXXXXXXXXX 240 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIillll 
PSAQLPSSHNPPPVSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPP 240 

NHHSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQIHAPAPAPNDLATTPESVQ 300 

Mill llllllllllllllllllll 
NHHSQSTLRPPLPPPHNHTLSHHHSSANSLNRNSLTNRRSQIHAPAPAPNDLiATTPESVQ 300 

LQDSWVLNSNVPLETRHFLFKXXXXXXXXXXXXXXXYPLTSGTVYTPPPRLLPRNTFSRK 360 

IIIIIIIIIIIIIIIIMIII llllllllllllllllllllllll 
LQDSWVLNSNVPLETRHFLFKTSSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRK 360 

AFKLKKPSKYCSWKCXXXXXXXXXXXXXXXXXYFI 3 95 

lllllilllllllll III 

AFKLKKPSKYCSWKCAALSAIAAALLLAILLAYFIAMHLIXLNWQLQPADGHTFNNGVRT 420 

VPWSLKNSSIDSGEAEVGRRVTQEVPPGVFWRSQIHISQPQFLK 43 9 

llllllllllllllllllllllllllllllllllllllllllll 
GLPGNDDVATVPSGGKVPWSLKNSSIDSGEAEVGRRVTQEVPPGVFWRSQIHISQPQFLK 4 80 



FNISLGKDALFGVYIRRGLPPSHAQYDFMERLDGKEKWSWESPRERRSIQTLVQNEAVF 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
FNISLGKDALFGVYIRRGLPPSHAQYDFMERLDGKEKWSWESPRERRSIQTLVQNEAVF 



499 



540 



559 



VQYLDVGLWHLAFYNDGKDKEMVSFNTWLDSVQDCPRNCHGNGECVSGVCHCFPGFLGA 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIII 
VQYLDVGLWHLAFYNDGKDKEMVSFNTWLDSVQDCPRNCHGNGECVSGLCHCFPGFLGA 6 00 

DCAKAACPVLCSGNGQYSKGTCQCYSGWKGAECDVPMNQCIDPSCGGHGSCIDGNCVCSA 619 

II II Illllllllil II II iiiiiii II lllllilllllllll II mill iiiiiiM 

DCAKAACPVLCSGNGQYSKGTCQCYSGWKGAECDVPMNQCIDPSCGGHGSCIDGNCVCAA 660 

GYKGEHCEEVDCLDPTCSSHGVCVNGECLCSPGWGGLNCELARVQCPDQCSGHGTYLPDT 6 79 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllk 
GYKGEHCEEVDCLDPTCSSHGVCVNGECLCSPGWGGLNCELARVQCPDQCSGHGTYLPDS 720 

GLCSCDPNWMGPDCSVEVCSVDCGTHGVCIGGACRCEEGWTGAACDQRVCHPRCIEHGTC 73 9 

llkllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GLCNCDPNWMGPDCSVEVCSVDCGTHGVCIGGACRCEEGWTGAACDQRVCHPRCIEHGTC 780 
********* 

KDGKCECREGWNGEHCTIGRQTAGTETDGCPDLCNGNGRCTLGQNSWQCVCQTGWRGPGC 799 

IIIMIIIIIIIIIIIII IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 
KDGKCECREGWNGEHCTI DGCPDLCNGNGRCTLGQNSWQCVCQTGWRGPGC 831 

NVAMETSCADNKDNEGDGLVDCLDPDCCLQSACQNSLLCRGSRDPLDI IQQGQTDWPAVK 859 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 
NVAMETSCADNKDNEGDGLVDCLDPDCCLQSACQNSLLCRGSRDPLDI IQQGQTDWPAVK 891 

SFYDRIKLLAGKDSTHIIPGENPFNSSLVSLIRGQWTTDGTPLVGVNVSFVKYPKYGYT 919 

IIIIIIIMIIIIIIIIIIhllllllllllllllllllilllllMIIIIIIIIIIIM 

SFYDRIKLLAGKDSTHI IPGDNPFNSSLVSLIRGQWTTDGTPLVGVNVSFVKYPKYGYT 951 
ITRQDGTFDLIANGGASLTLHFERAPFMSQERTVWLPWNSFYAMDTLVMKTEENSIPSCD 979 

IIIIIIIIIIIMIkHllllllilillHIIII MIIIMIIIIillllMIIIIII 

ITRQDGTFDLIANGGSALTLHFERAPFMSRERTVWPPWNSFYAMDTLVMKTEENSIPSCD 1011 
LSGFVRPDPIIISSPLSTFFSAAPGQNPIVPETQVIaHEEIELPGSNVKLRYLSSRTAGYK 1039 

llllllllllllillllllllM lllllllllllllllllhlllllllllllllll 

LSGFVRPDP 1 1 1 SS PLSTFFSAS PAANP I VPETQVLHEE I ELPGTNVKLRYLSSRTAG YK 1071 
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SLLK I TMT j^^PLNL I R VHLMVAVEGHL FQKS FQAS PNLASTF :^^WDAYGQRVYGLS 1099 

IlillllVlllilllNIIIIIIIIIMIIIMIII IIIWIIIIIIMIIM 

SLLK I TMTQSTVPLNL I R VHLMVAVEGHL FQKS FQAS PNLAYTF I WDKTDAYGQRVYGLS 1131 



DAWSVGFEYETCPSLILWEKRTALLQGFELDPSNLGGWSLDKHHILNVKSGILHKGTGE 1159 

IIIMIMIIIMIMIIIIIIIIIIIIIIIIIMIMIIIIIII MIIIMI IIIM 

DAWSVGFEYETCPSLILWEKRTALLQGFELDPSNLGGWSLDKHHTLNVKSGILLKGTGE 1191 
NQFLTQQPAI I TS I MGNGRRRS I SC PS CNGLAEGNKLLAP VALAVG I DGS L YVGDFNY I R 1219 

lllllllillllllMillllllllllllMIMIIIIIIIIMIMMIkllllMII 

NQFLTQQPAIITSIMGNGRRRSISCPSCNGLAEGNKLLAPVALAVGIDGSLFVGDFNYIR 1251 
R I FP SRNVTS I LELRNKE FKHSNNPAHKYYLAVDPVSGS L YVS DTNSRR I YRVKS LSGTK 1279 

IMIMIilMIIMIIIIIIihi lllillllikllilllllllMIMiilMI I 

RIFPSRNVTSILELRNKEFKHSNSPGHKYYLAVDPVTGSLYVSDTNSRRIYRVKSLSGAK 1311 
DLAGNSEWAGTGEQCLPFDEARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDATMIRKV 133 9 

llllllllllllllllllillllllllllhllllllllillllllllllilMIMIII 

DLAGNSEWAGTGEQCLPFDEARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDATMIRKV 1371 
DQNGIISTLLGSNDLTAVRPLSCDSSMDVAQVRLEWPTDLAVNPMDNSLYVLENNVILRI 1399 

IIMIIMIIIIIIIIIIillllMIIIMIillMIIIMIMIIIIIillllllllli 

DQNGI ISTLLGSNDLTAVRPLSCDSSMDVAQVRLEWPTDLAVNPMDNSLYVLENNVILRI 1431 

TENHQVS I lAGRPMHCQVPGIDYSLSKXXXXXXXXXXXXXXXXXTGVLYITETDEKKINR 1459 

llllllllilillllllllllllllll llllllllllllllll 
TENHQVSIIAGRPMHCQVPGIDYSLSKLAIHSALESASAIAISHTGVLYITETDEKKINR 1491 

LRQVTTNGE I CLLAGAASXXXXXXXXXXXX YSGDDAYATDA I LNS PSS LAVAPDGT lYIA 1519 

llllllllllllllllll llllllllllllllillllMIIIIMIII 

LRQVTTNGE X CLLAGAAS DCDCKND VNC I C YSGDDAYATDAI LNS PSS LAVAPDGT I Y I A 1551 

DLGN I R I RAVS KNKP VLNAFNQ YEAAS PGEQEL YVFNADG I HQ YTVSLVTGE YL YNFT YS 1579 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIII 
DLGN I R I RAVS KNKP VLNAFNQ YEAAS PGEQEL YVFNADG I HQ YTVSLVTGE YL YNFT YS 1611 

TDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKWSTQNLELGLM 163 9 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIilllllllll IIIIIMIIII 
ADNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKAVSTQNLELGLM 1671 

TYDGNTGLLATKSDETGWTTFYDYDHEGRLTNVTRPTGWTSLHREMEKS ITIDI ENSNR 1699 

llllllllllllllllillllllllllllMIIIIMIIIIIMIIIIMIklllllll 

TYDGNTGLLATKSDETGWTTFYDYDHEGRLTNVTRPTGWTSLHREMEKS ITVDIENSNR 1731 
DDDVTVITNLSSVEASYTWQDQVRNSYQLCNNGTLRVMYANGMGISFHSEPHVLAGTIT 1759 

hlllllllllllllllllllillllllMkllMIMIIIIIMIIMIIillll^l 

DNDVTVITNLSSVEASYTWQDQVRNSYQLCSNGTLRVMYANGMGVSFHSEPHVLAGTLT 1791 

PTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSIDYDRNIRTEKI 1819 

IIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIII 
PTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSIDYDRNIRTEKI 1851 

YDDHRKFTLRI I YDQVGRPFLWLPSSGLAAVNVS YFFNGRLAGLQRGAMSERTDIDKQGR 1879 

iMIIIMMIIIIIMIIMMIIIIIIIMIIIillllllMllllilllllMIIII 

YDDHRKFTLRI I YDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGR 1911 
IVSRMFADGKVWSYSYLDKSMVLLLQSQRQYIFEYDSSDRLLAVTMPSVARHSMSTHTSI 1939 

MIIIMIIIIIIilllllllllllMIIIIIIIIIIMM llllllllllllllllll 

IVSRMFADGKVWSYSYLDKSMVLLLQSQRQYIFEYDSSDRLHAVTMPSVARHSMSTHTSI 1971 
GYIRNIYNPPESNASVIFDYSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTF 1999 

llllllllllllililllllilllllllllllllllllllllllllllllllllllllll 

GYIRNIYNPPESNASVIFDYSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTF 2031 
GYDETTGVLKMVNLQSGGFSCTIRYRKIGPLVDKQIYRFSEEGMVNARFDYTYHDNSFRI 2059 

lllllllllllllllllllllllllll^lilllllllllllllhlllllllllllllll 

GYDETTGVLKMVNLQSGGFSCTIRYRKVGPLVDKQIYRFSEEGMINARFDYTYHDNSFRI 2091 
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lllllllll 

ASIKPVIS 



lUJI 



llllllllllllllllllllllllllll 

VDLYRYDEISGKVEHFGKFGVIYYDINQIIT 



Ullllllll 

.SKHFDTHGRI 2151 



KEVQYEMFRSLMYWMTVQYDSMGRVI KRELKLGPYANTTKYTYDYIX3DGQLQSVAVNDRP 2179 

lllllllllllillllllMlllllllllilllMIIIIIIIIIIIIIIIIIIIIIIIII 
KEVQYEMFRSl^YWMTVQYDSMGRVI KRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRP 2211 

TWRYSYDXXXXXXXXXXXXSVRLMPLRYDLRDRITRLGDVQYKIDDDGYLCQRGSDIFEY 2239 

lllllll I IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

TWRYS YDLNGNLHLLNPGNSARLMPLR YDLRDR ITRLGDVQYKIDDDGYLCQRGSDI FEY 2271 

NSKGLLTRAYNKASGWSVQYRYDGVGRRASYKTNLGHHLQYFYSDLHNPTRITHVYNHSN 2299 

llllllllllllillllilllllll IIIIIIIIMIIIIIIIIIIklMIIIIIIIM 

NSKGLLTRAYNKASGWSVQYRYDGVSRRASYKTNLGHHLQYFYSDLHHPTRITHVYNHSN 2331 
SE I TSLYY0LQGHLFAMESSSGEE YYVASDNTGTPLAVFS INGLMI KQLQYTAYGE I YYD 2359 

lilllllllllllMIIIIIIIIIIIIMIIIIMIIIMIMIIMIIMIIillllM 

SE I TSLYYDLQGHLFAMESSSGEE YYVASDNTGTPLAVFS INGLMI KQLQYTAYGE I YYD 2391 
SNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDYTMWKNVGKEPAPFNLYMF 2419 

IIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII^MIIIIIIMIIII 

SNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDYTMWRNVGKEPAPFNLYMF 24 51 
KSNNPLSSELDLKNYVTDVKSWLVMFGFQLSNIIPGFPRAKMYFVPPPYELSESQASENG 2479 

hlllllHIIIIIilllllillllliMIIIIIIMIIIIMIIIMIIIIIIIIIIII 

KNNNPLSNELDLKNYVTDVKSWLVMFGFQLSN 1 1 PGFPRAKMYFVPPPYELSESQASENG 2511 
QLITGVQQTTERHNQAFMALEGQVITKKLHASIREKAGHWFATTTPIIGKGIMFAIKEGR 2539 

llllllllllllllilMIIIIIMMII llllllllllllllllllllllllllll 

QLITGVQQTTERHNQAFLALEGQVISKKLHAGIREKAGHWFATTTPIIGKGIMFAIKEGR 2571 

VTTGVSSIASEDSRKVASVLNNAYYLDKMHYSIEGKDTHYFVKIGSADGDLVTLGTTIGR 2599 

lllllllllllllllllllllllllllllllllllllllllllll^llllllllllllll 
VTTGVSSIASEDSRKVASVLNNAYYLDKMHYSIEGKDTHYFVKIGAADGDLVTLGTTIGR 2631 

KVLESGVNVTVSQPTLLVNGRTRRFTNIEFQYSTLLLSIRYGLTPDTLDEEKARVLDQAR 2659 

lllllllllllllllillllllllllllllllllllllllllllllllllllllllllll 

KVLESGVNVTVSQPTLLVNGRTRRFTNIEFQYSTLLLSIRYGLTPDTLDEEKARVLDQAR 2691 

QRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSS 2719 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
QRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSS 2751 

SNIQFLRQNEMGKR 2733 

llllllllllllll 
SNIQFLRQNEMGKR 2765 



FCTR3F DOES NOT CONTAIN THESE AMINO ACIDS 



50 The amino acid sequences of the FCTRSbcde and f proteins were also found to have 

2536 of 2774 amino acid residues (91%) identical to, and 2558 of 2774 residues (91%) positive 
v^th, the 2764 amino acid residue protein Odd Oz/ten-m homolog 2 (Drosophila) (GenBank 
Acc:NP_035986.2) (SEQ ID NO:65), shown in Table 3U. 

Table 3U. BLAST? of FCTR3bcde and f against Odd Oz/ten-m homolog 2 (SEQ ID 
55 NO:65) 

> GI|76S7415|REF|NP 035986.21 ODD OZ/TEN-M HOMOLOG 2 (DROSOPHILA) ; ODD OZ/TEN-M HOMOLOG 
3 

(DROSOPHILA) [MUS MUSCULUS] 
GI |4760778|DBJ|BAA77397.1| (AB025411) TEN-M2 [MUS MUSCULUS) 
60 LENGTH =2764 



SCORE = 4996 BITS (12961), EXPECT =0.0 
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IDENTITIES = 2536/2774 (91%), POSITIVES = 2558/2774 (91% 
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MDVKDRRHx^BlGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETlBirDHDSRMHYGNR 

1 1 M 1 1 M I mi 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 

MDVKDRRHRSLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGNR 



lAPS = 51/2774 (1%) 



********* 



SFYDRIKLLAGKDSTHIIPGENPFNSSLVSLIRGQWTTDGTPLVGVNVSFVKYPKYGYT 

IMIIIMIMIIIIIIIIhlllMIIIIIMIIIII IIIIIIIIIIIIIIIIIIIM 

SFYDRIKLLAGKDSTHIIPGDNPFNSSLVSLIRGQWTMDGTPLVGVNVSFVKYPKYGYT 
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VTDLIHRESDEFPRQGTNFTLAELGICEPSPHRSGYCSDMGILHQGYSLSTGSDADSDTE 120 

IMklMIIII IIIIIIIIIMIIIIilMMIIIIIIIIIIIMIIIIIIIIIilll 

VTDLVHRE S DE FSRQGTNFTLAELG I CE PS PHRSG YCSDMG I LHQG YSLSTGS DADSDTE 120 
GGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTXXXXXXXXXXXXGRXXXXXXXXXXX 180 

IIMIIIIIIIMIIIMIIIIMIIIIIIIIIII M 

GGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTDSDNENKSDDDNGRPIPPTSSSSLL 180 
XXXXXXXXHNPPPVSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACXXXXXXXXXXXX 240 

MMMillillllllllllllllllllllllllMMM 

PSAQLPSSHNPPPVSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPP 240 
NHHSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQIHAPAPAPNDLATTPESVQ 300 

Mill IlillliMlillMIIII! 

NHHSQSTLRPPLPPPHNHTLSHHHSSANSLNRNSLTNRRSQIHAPAPAPNDLATTPESVQ 3 00 

LQDSWVLNSNVPLETRHFLFKXXXXXXXXXXXXXXXYPLTSGTVYTPPPRLLPRNTFSRK 360 

lllllllllllllllllllll IIIIMIIIIIIIMIIIIIIIII 
LQDSWVLNSNVPLETRHFLFKTSSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRK 360 

AFKLKKPSKYCSWKCXXXXXXXXXXXXXXXXXYFI 3 95 

lllllllllllllll III 

AFKLKKPSKYCSWKCAALSAIAAALLIAILIAYFIAMHLLGLNWQLQPADGHTFNNGVRT 420 

VPWSLKNSSIDSGEAEVGRRVTQEVPPGVFWRSQIHISQPQFLK 43 9 

llllllllllllllllllllllllllllllllllllllllllll 
GLPGNDDVATVPSGGKVPWSLKNSSIDSGEAEVGRRVTQEVPPGVFWRSQIHISQPQFLK 4 80 

FNISU3KDALFGVYIRRGLPPSHAQYDFMERLDGKEKWSWESPRERRSIQTLVQNEAVF 4 99 

llllllllllllllillllllllllllllllllllllllllllllllllllllllillll 
FNISLGKDALFGVYIRRGLPPSHAQYDFMERLDGKEKWSWESPRERRSIQTLVQNEAVF 540 

VQYLDVGLWHLAFYNDGKDKEMVSFNTWLDSVQDCPRNCHGNGECVSGVCHCFPGFLGA 559 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIkllllllllll 

VQYLDVGLWHLAFYNDGKDKEMVSFNTWLDSVQDCPRNCHGNGECVSGLCHCFPGFLGA 600 

DCAKAACPVLCSGNGQYSKGTCQCYSGWKGAECDVPMNQCIDPSCGGHGSCIDGNCVCSA 619 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIM 
DCAKAACPVLCSGNGQYSKGTCQCYSGWKGAECDVPMNQCIDPSCGGHGSCIDGNCVCAA 660 

GYKGEHCEEVDCLDPTCSSHGVCVNGECLCSPGWGGLNCELARVQCPDQCSGHGTYLPDT 679 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIk 

GYKGEHCEEVDCLDPTCSSHGVCVNGECLCSPGWGGLNCELARVQCPDQCSGHGTYLPDS 720 

GLCSCDPNWMGPDCSVEVCSVDCGTHGVCIGGACRCEEGWTGAACDQRVCHPRCIEHGTC 739 

llllllllllllllll IIIIIIIIIMIIIIIIIIIIilllllllllllllllllllll 
GLCSCDPNWMGPDCS V - VCS VDCGTHGVCIGGACRCEEGWTGAACDQRVCHPRCI EHGTC 779 



799 



KDGKCECREGWNGEHCTIGRQTAGTETDGCPDLCNGNGRCTLGQNSWQCVCQTGWRGPGC 

llllllllllilllllll lllllllllllllllllllllllllllllllll 
KDGKCECREGWNGEHCT I DGCPDLCNGNGRCTLGQNS WQCVCQTGWRGPGC 830 

NVAMETSCADNKDNEGDGLVDCLDPDCCLQSACQNSLLCRGSRDPLDI IQQGQTDWPAVK 859 

lllllllllllllllllllllllllllllllllillllllMIMIIIillliMIIIII 

NVAMETSCADNKDNEGDGLVDCLDPDCCLQSACQNSLLCRGSRDPLDI IQQGQTDWPAVK 890 



919 



950 
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I TRQDGTFDL I ANGGAS LTLHFERAPFMSQERTVWLPWNS FYAMDTLVMKTEENS I PSCD 979 

lllllllllllllllllllll 11 I lllllllllllllll II I II I II II II IIIIIIM 
ITRQDGTFDLI ANGGSALTLHFERAP FMSQERTVWLPWNS FYAMDTLVMKTEENS I PSCD 1010 
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LSGFVRPDP 
lllllll 

lsgfvrpd: 



PLLIS 

• 



SSPLSTFFSAAPGQNPIVPETQVLHEEIELPGSj 

IMIIIIIkl IMIIMIIIMIIIIII 

SPLSTFFSASPASNPIVPETQVLHEEIELPG' 




RYLSSRTAGYK 1039 

lilllllllll 
RYLSSRTAGYK 1070 



S LLK I TMTQSTVPLNL I RVHLMVAVEGHLFQKS FQAS PNLASTF I WDKTDAYGQRVYGLS 1099 

IIIIIIIIMIMIIIIIIIIIIIIIIIIIIMIIIMIII IIIIIMIIIIIIIIIM 

SLLKITMTQSTVPLNLIRVHLMVAVEGHLFQKSFQASPNLAYTFIWDKTDAYGQRVYGLS 113 0 
DAWSVGFEYETCPSLILWEKRTALLQGFELDPSNLGGWSLDKHHILNVKSGILHKGTGE 1159 

IIIIIIIIIIMIIIIIIMIIIIIIMIIIIIilllllllllll IIIIIIIIIMIII 

DAWSVGFEYETCPSLILWEKRTALLQGFELDPSNLGGWSLDKHHTLNVKSGILHKGTGE 1190 
NQ FLTQQPAI I TS I MGNGRRRS I S C PSCNGLAEGNKLLAPVALAVG I DGSL YVGDFNY I R 1219 

IIIIMIIMIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIkllllllll 

NQFLTQQPAIITSIMGNGRRRSISCPSCNGLAEGNKLLAPVALAVGIDGSLFVGDFNYIR 1250 
RI FPSRNVTS I LELRNKEFKHSNNPAHKYYLAVOPVSGSLYVSDTNSRRI YRVKSLSGTK 1279 

IIIIIIIIIIIIIIIIIIIIIIM IIIIIIIIIMIIIIMilllMIIIIIIM I 

RIFPSRNVTSILELRNKEFKHSNSPGHKYYLAVDPVTGSLYVSDTNSRRIYRVKSLSGAK 1310 

DLAGNSEWAGTGEQCLPFDEARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDATMIRKV 1339 

IlillllMIIIIIIIIIIIIIIIIIIIIIHIIIIIIIIIIIIMIIMIIIIIIMII 
DLAGNSEWAGTGEQCLPFDEARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDATMIRKV 1370 

DQNGIISTLLGSNDLTAVRPLSCDSSMDVAQVRLEWPTDLAVNPMDNSLYVLENNVILRI 1399 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 
DQNGIISTLLGSNDLTAVRPLSCDSSMDVAQVRLEWPTDLAVNPMDNSLYVLENNVILRI 1430 

TENHQVSIIAGRPMHCQVPGIDYSLSKXXXXXXXXXXXXXXXXXTGVLYITETDEKKINR 1459 

llllillllllllllilllllllllil IlilllllllllMII 

TENHQVSIIAGRPMHCQVPGIDYSLSKIiAIHSALESASAIAISHTGVLYITETDEKKINR 14 90 
LRQVTTNGEICLLAGAASXXXXXXXXXXXXYSGDDAYATDAILNSPSSLAVAPDGTIYIA 1519 

illlllllllllllllll illllllllllllllllllllllllMIII 

LRQVTTNGEICLLAGAASDCDCKNDVNCICYSGDDAYATDAIIiNSPSSLAVAPDGTIYIA 1550 
DLGNIRIRAVSKNKPVLNAFNQYEAASPGEQELYVFNADGIHQYTVSLVTGEYLYNFTYS 1579 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIMIIIIII 

DU3NIRIRAVSKNKPVLNAFNQYEAASPGEQELYVFNADGIHQYTVSLVTGEYLYNFTYS 1610 

TDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKWSTQNLELGLM 163 9 

lllllllllillllilll II lllli MM I II II lilllllllll II IIIIIIIMII 
ADNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKAVSTQNLELGLM 1670 

T YDGNTGLLATKS DETGWTTF YD YDHEGRLTNVTRPTG WTS LHREME KS ITIDIENSNR 1699 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 
TYDGNTGLLATKSDETGWTTFYDYDHEGRLTNVTRPTGWTSLHREMEKS ITIDIENSNR 1730 

DDDVTVITNLSSVEASYTVVQDQVRNSYQLCNNGTLRVMYANGMGISFHSEPHVLAGTIT 1759 

IIIIIIIIIIMIIMIIIIIIMIIIIIIIIIIIIIIIIIIII HIIIIMMIIIII 

DDDVTVI TNLSS VEAS YTWQDQVRNS YQLCNNGTLRVMYANGMAVS FHSE PHVLAGT IT 1790 
PTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSIDYDRNIRTEKI 1819 

IIMMIIMMMIMIIIIIIIIIIIM III MM IIIIIIIMII III II IIMMI 

PTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSIDYDRNIRTEKI 1850 
YDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGR 1879 

IIIIIIIMI II II Mill II II III II IIMMI llllll lllllll Ml IIIIM III 

YDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGR 1910 
XVSRMFADGKWSYSYLDKSMVLLLQSQRQYIFEYDSSDRLLAVTMPSVARHSMSTHTSI 193 9 

IIIMMIIIMMIMIIIIIIIIIIIIIIIIIIIIIIII IIIIMMIMMIIIII 

IVSRMFADGKVWSYSYLDKSMVLLLQSQRQYIFEYDSSDRLHAVTMPSVARHSMSTHTSI 1970 
GYIRNIYNPPESNASVIFDYSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTF 1999 

llllllllll II lllllll lllllll II IIMIMIIIIII lllllll III MIIMMI 

GYIRNIYNPPESNASVIFDYSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTF 2030 



IIMMI III IIMIIIIIIIII I llkll 1 1 Mill IIIIIIIMI MM Ml IIIIM I 
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203X gydettgvl: 



IjKMVNLQi 
1^PI>VD 



SGGFSCTIRYRKVGPLVDKQIYRFSEEGMI 



DYTYHDNSFRI 2090 



2060 ASIKPVIS^^PPVDLYRYDEISGKVEHFGKFGVIYYDINQIITT^WFTLSKHFDTHGRI 2119 

MllllllllilMIIMIIMIIIIillMIIIIMIIIMIillMIIIIIIIIIIII 

2091 ASIKPVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRI 2150 
2120 KEVQYEMFRSIJ^YWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYIX5IX3QI^SVAV^ 2179 

lllllllillllllllllllllllllllllllllllMIMIillllllllllllMIII 

2151 KEVQYEMFRSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRP 2210 
2180 TWRYSYDXXXXXXXXXXXXSVRLMPLRYDLRDRITRLGDVQYKIDDDGYLCQRGSDIFEY 2239 

lllllll I llllllllllllillllMIIIIIIIIIIIMIIIIIII 

2211 TWRYSYDLNGNLHLLNPGNSARLMPLRYDLRDRITRLGDVQYKIDDDGYLCQRGSDIFEY 2270 
2240 NSKGLLTRAYNKASGWSVQYRYDGVGRRASYKTNLGHHLQYFYSDLHNPTRITHVYNHSN 2299 

IIIIIIMIIjlllllMlillllMMIIIIIMIIIIIIIIIIIIIIIIIIMMIII 

2271 NSKGLLTRAYNKASGWSVQYRYDGVGRRASYKTNLGHHLQYFYSDLHNPTRITHVYNHSN 2330 
2300 SEITSLYYDLQGHLFAMESSSGEEYYVASDNTGTPLAVFSINGLMIKQLQYTAYGEIYYD 2359 

MIIIIMIIIMIIIMIIMIIIIIIMIIIIIIIkllMIIIIIIIIIIIIIIIII 

2331 SEITSLYYDLQGHLFAMESSSGEEYYVASDNTGTPLAVYSINGLMIKQLQYTAYGEIYYD 2390 



2360 SNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDYTMWKNVGKEPAPFNLYMF 2419 

IIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIMMHIIIIIIIIIIIII 

23 91 SNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDYTMWRNVGKEPAPFNLYMF 



2450 



2420 KSNNPLSSELDLKNYVTDVKSWLVMFGFQLSNIIPGFPRAKMYFVPPPYELSESQASENG 2479 

l-^llllhlMIMIIIMIIIIIIIIMIMIIIIIIIIMMIIIIIIIIIIIIIIII 

2451 KNNNPLSNELDLKNYVTDVKSWLVMFGFQLSNIIPGFPRAKMYFVPPPYELSESQASENG 2510 



24 80 QL I TGVQQTTERHNQAFMALEGQV I TKKLHAS I REKAGHW FATTTP 1 1 GKG I MFAI KEGR 

IIIIIIIIIIIIIIMkllllMIIIIIIMIIIIIIIIIIIIIIIIIilllllMMI 

2511 QLITGVQQTTERHNQAFLALEGQVXTKKLHASIREKAGHWFATTTPIIGKGIMFAIKEGR 



2539 



2570 



2540 



VTTGVSSIASEDSRKVASVLNNAYYLDKMHYSIEGKDTHYFVKIGSADGDLVTLGTTIGR 2599 

IIIIIIIIIIIMMIIIIIIMMIillllllllllllMMMHIIIIIIIIIIIIi 

2571 VTTGVSSIASEDSRKVASVLNNAYYLDKMHYSIEGKDTHYFVKIGAADGDLVTLGTTIGR 2630 



2600 KVLESGVNVTVSQPTLLVNGRTRRFTNIEFQYSTLLLSIRYGLTPDTLDEEKARVLDQAR 

IIIIIIIIMIIIIIIIIIMIIMIIIIMIIIIIIIIIIIMIIIIIIIilllllll 

KVLESGVNVTVSQPTLLVNGRTRRFTNIEFQYSTLLLSIRYGLTPDTLDEEKARVLDQAG 



2631 



2660 QRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSS 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIMMIIIM 

2691 QRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSS 



2659 



2690 



2719 



2750 



QUERY: 2720 SNIQFLRQNEMGKR 2733 

llllllllllllll 
SBJCT: 2751 SNIQFLRQNEMGKR 2764 

* = FCTR3F DOES NOT CONTAIN THESE AMINO ACIDS 

FCTR3 is related to rat neurestin, a gene implicated in neuronal development (Otaki JM, 
Firestein S Dev Biol 1999 Aug 1;2 12(1): 165-81) Neurestin shows homology to human gamma- 
heregulin, a Drosophila receptor-type pair-rule gene product. Odd Oz (Odz) / Ten(m), and 
Ten(a). Neurestin has putative roles in synapse formation and brain morphogenesis. A mouse 
neurestin homolog, D0C4, has independently been isolated from the NIH-3T3 fibroblasts. 
D0C4 is also known as tenascin M (TNM), a Drosophila pair-rule gene homolog containing 
extracellular EGF-like repeats. The significant homology to these molecules and in particular, y- 
heregulin, have important implications regarding the potential contribution of FCTR3 to disease 
progression. Heregulin is the ligand for HER-2/ErbB2/NEU, a proto-oncogene receptor tyrosine 
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kinase implicated in br^^and prostate cancer progression that ^^originally identified in rat 
neuro/glioblastoma celNRs. Extopic expression of HER-2/Erb^^SfEU in MDA-MB-435 



breast adenocarcinoma cells confers chemoresistance to Taxol-induced apoptosis relative to 
vector transfected control cells (Yu et al. Overexpression of ErbB2 blocks Taxol-induced 
5 apoptosis by up-regulation of p21Cipl, which inhibits p34Cdc2 kinase. Molec. Cell 2: 581-591, 
1998). 

FCTR3 related tenascins and cancer biology 

As mentioned, FCTR3 also has significant homology to D0C4, (AKA tenascin M), a 
Drosophila pair-rule gene homolog containing extracellular EGF-like repeats. The tenascins are 
10 a growing family of extracellular matrix proteins that play prominent roles in tissue interactions 
critical to embryogenesis. Overexpression of tenascins has been described in multiple human 
solid malignancies. 

The role of the tenascin family of related proteins is to regulate epithelial-stromal 
interactions, participate in fibronectin-dependent cell attachment and interaction. Indeed, 

^[J 1 5 tenascin-C (TN) is overexpressed in the stroma of malignant ovarian tumours particularly at the 
interface between epithelia and stroma leading to suggestions that it may be involved in the 

□ process of invasion (Wilson et al (1996) Br J Cancer 74: 999-1004). Tenascin-C is considered a 

therapeutic target for certain malignant brain tumors (Gladson CL : J Neuropathol Exp Neurol 
1999 Oct;58(10):1029-40). Stromal or moderate to strong periductal Tenascin-C expression in 

E 

Q 20 DCIS (ductal carcinoma in situ) correlates with tumor cell invasion. (Jahkola et al. Eur J Cancer 
f=t 1998 Oct;34(l l):1687-92. Tenascin-C expression at the invasion border of early breast cancer is 

^jj a useful predictor of local and distant recurrence. Jahkola T, et al. Br J Cancer. 1998 

hi: Dec;78(l 1): 1507-13). Tenascin (TN) is an extracellular matrix protein found in areas of cell 

migration during development and expressed at high levels in migratory glioma cells. 
25 Treasurywala S, Berens ME Glia 1998 Oct;24(2):236-43 Migration arrest in glioma cells is 

dependent on the alphaV integrin subunit. Phillips OR, Krushel LA, Crossin KL J Cell Sci 1 998 
Apr;l 1 1 ( Pt 8): 1095 -104 Domains of tenascin involved in glioma migration. Finally, tenascin 
expression in hormone-dependent tissues of breast and endometrium indicate that Tenascin 
expression reflects malignant progression and is down-regulated by antiprogestins during 
30 terminal differentiation of rat mammary tumors (VoUmer et al. Cancer Res 1992 Sep 
l;52(l7):4642-8) 

Potential role of FCTR3 in oncologic disease progression: 
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T related molecules, FCTR3 



may play a role in one oBRore aspects of tumor cell biology thanmer the interactions of tumor 
epithelial cells with stromal components. In consideration, FCTR3 may play a role in the 
following malignant properties: 

Autocrine/paracrine stimulation of tumor cell proliferation 
Autocrine/paracrine stimulation of tumor cell survival and tumor cell resistance to 
cytotoxic therapy 

Local tissue remodeling, paranechmal and basement membrane invasion and motility of 
tumor cells thereby contributing to metastasis. 

Tumor-mediated immunosuppression of T-cell mediated immune effector cells and 
pathways resulting in tumor escape from immune surveilance. 

Therapeutic intervention targeting FCTR3 in oncologic and central nervous system 



Predicted disease indications from expression profiling in 41 normal human tissues and 
55 human cancer cell lines (see Example 2) include a subset of human gliomas, astrocytomas, 
mixed glioma/astrocytomas, renal cells carcinoma, breast adenocarcinoma, ovarian cancer, 
melanomas. Targeting of FCTR3 by human or humanized monoclonal antibodies designed to 
disrupt predicted interactions of FCTR3 with its cognate ligand may result in significant anti- 
tumor/anti-metastatic activity and the amelioration of associated symptomatology. 
Identification of small molecules that specifically/selectively interfere with dovrastream 
signaling components engaged by FCTR3/ligandinteractions would also be expected to result in 
significant anti-tumor/anti -metastatic activity and the amelioration of associated 
symptomatology. Likewise, modified antisense ribonucleotides or antisense gene expression 
constructs (plasmids, adenovirus, adeno-associated viruses, "naked" DNA approaches) designed 
to diminish the expression of FCTR3 transcripts/messenger RNA (mRNA) would be anticipated 
based on predicted properties of FCTR3 to have anti-tumor impact. 

Based on the relatedness to neurestin and heregulins, as well as its high level expression 
in brain tissue, FCTR3 may also be used for remyelination in order to promote 
regeneration/repair/remyleination of injured central nervous system cells resulting from 
ischemia, brain trauma and various neurodegenerative diseases.. This postulate is based on 
reports indicating that neuregulin, glial grov^h factor 2, diminishes autoinunune demyelination 
and enhances remyelination in a chronic relapsing model for multiple sclerosis (Cannella et al. . 



indications: 
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Proc. Nat. Acad. Sci. 9^^100-10105, 1998). The expression related molecule neurestin 
can be induced in exterwRufted cells during regeneration of ol^^ry sensory neurons. 



FCTR4 

FCTR4 is a plasma membrane protein related to NF-Kappa-B P65delta3 protein. The 
5 clone is expressed in fetal liver tissues. 

The novel FCTR4 nucleic acid of 609 nucleotides (also referred to as 29692275.0.1) is 
shown in Table 4A. An ORF begins v^th an ATG initiation codon at nucleotides 99-101 and 
ends with a TAA codon at nucleotides 522-524. A putative untranslated region upstream from 
the initiation codon and downstream from the termination codon is underlined in Table 4A, and 
10 the start and stop codons are in bold letters. 

Table 4A. FCTR4 Nucleotide Sequence (SEQ ID NO: 14) 

CTGACATACTATATTAGTTGTTTGTTCACTGTCTCCACTCCAGCTAGAATATAAGTTCCATAGGGCAGAGTTTTTGTTCA 
CTGCTATATTTTATAAGCA TGAATGAATGCATGAACGAATGGACTGATAACCCACAAGCCAAAGACCTCCATGACCTGCC 
ACTGCCCTCCTTTCATTTTATTCTCACCTCTACCAATACTAAATCACCTAGTTATGTAAATACGATATGCACTTTCATGG 
15 CCCCTTGCTTTGTCATATGCTGTTCCCTTTGCCTGGAATATAAACTCTCAAAATACCATCCACATTTTAAAATCTTCTCC 
AGAAAGCTTCCTCTGTCCACCCCCACCCTCCCACCCCCATATAGAGTAAGTCAGTCTTTCCTTTGTGCTACATTTGTACC 
TGTATCTACAGTGGCTCTAATCAAACTGCACTGTGTCTCTCACTTCCTAGATTGTGAACTCTTTGAGGCTGAAGACTACT 
TATTCATCTCTTTACCTCCAATGCCTAGGACAGGACCTTCATA AAGCAACTACTCTATAAATGTTGAAACATATGCATGA 
CTATTCTGTAACAGGAATGAAAATATGGCATTTCAAGAAGTCACTACTC 

20 

The FCTR4 protein encoded by SEQ ID NO: 14 has 141 amino acid residues and is 
presented using the one-letter code in Table 4B. The Psort profile for FCTR4 predicts that this 
sequence has no N-terminal signal peptide and is likely to be localized at the plasma membrane 
wdth a certainty of 0.6000. The most likely cleavage site for a peptide is between amino acids 39 
25 and 40, i.e,, at the dash in the amino acid sequence ACT-CCA, based on the Signal? result. The 
predicted molecular weight of this protein is 16051 .5 Daltons. 

Table 4B. Encoded FCTR4 protein sequence (SEQ ID NO:15). 

MNECMNEWTDNPQAKDLHDLPLPSFHFILTSTNTKSPSYVNTICTFMAPCFVICCSLCLEYKLSKYHPHFKIFSRKLPLSTPTLPP 
PYRVSQSFLCATFVPVSTVALIKLHCVSHFLDCELFEAEDYLFISLPPMPRTGPS 

30 

The predicted amino acid sequence was searched in the publicly available GenBank 
database FCTR4 protein showed 30 % identities (22 over 72 amino acids) and 43% homologies 
(3 lover 72 amino acids) with hypothetical 10 kD protein of Trypanosoma cruzi (86 aa; 
ACC:Q99233) shown in Table 4C. The best homologies with a human protein were 54 % 
35 identities (114 over 343 amino acids) with NF-Kappa-B P65delta3 protein (71 aa fragment; 
ACC:Q13313) (SEQ ID NO:77). 

Table 4C. BLAST? of FCTR4 against protein sequences 

BLAST X search results are shown below: 

ptnr:SPTREMBL-ACC:Q99233 HYPOTHETICAL 10 KD PROTEIN +3, 68, 0.60, 1, (SEQ ID 

92 15966-697 
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: JJ^6896 GABA RECEPTOR SUBUNI-^^EI 



NO:73) 

ptnr:SPTREMBL-ACC^6896 GABA RECEPTOR SUBUNlT^^DES +3, 66, 0.81, 4 (SEQ 
IDNO:74) 

ptnr:SPTREMBL-ACC:076473 GABA RECEPTOR SUBUNIT - LEPTI... +3, 66, 0.99, 2 (SEQ 
ID NO:75) 

ptnr:TREMBLNEW-ACC:AAD28317 F13J11.13 PROTEIN - Arabid... +3, 62, 0.99, 1 (SEQ ID 
NO:76) 

Based upon homology, FCTR4 proteins and each homologous protein or peptide may 
share at least some activity. 



FCTR5 

FCTR5 is a protein bearing sequence homology to human complement CIR component 
precursor. The clone is expressed in breast, heart, lung, fetal lung, salivary gland, adrenal gland, 
spleen, kidney, and fetal kidney. 

The novel FCTR5 nucleic acid of 1667 nucleotides (also referred to as 32125243.0.21) is 
shown in Table 5A. An ORF begins with an ATG initiation codon at nucleotides 34-36 and ends 
with a TGA codon at nucleotides 1495-1497. A putative untranslated region upstream from the 
initiation codon and downstream from the termination codon is underlined in Table 5 A, and the 
start and stop codons are in bold letters. 

Table 5A. FCTRSa Nucleotide Sequence (SEQ ID NO: 16) 

GTTCTCTCGCAGGTCCCAGATGTCCAGTTCCAG ATGCCTGGACCCAGAGTGTGGGGGAAATATCTCTGGAGAAGCCCTCA 

CTCCAAAGGCTGTCCAGGCGCAATGTGGTGGCTGCTTCTCTGGGGAGTCCTCCAGGCTTGCCCAACCCGGGGCTCCGTCC 

TCTTGGCCCAAGAGCTACCCCAGCAGCTGACATCCCCCGGGTACCCAGAGCCGTATGGCAAAGGCCAAGAGAGCAGCACG 

GACATCAAGGCTCCAGAGGGCTTTGCTGTGAGGCTCGTCTTCCAGGACTTCGACCTGGAGCCGTCCCAGGACTGTGCAGG 

GGACTCTGTCACAATCTCATTCGTCGGTTCGGATCCAAGCCAGTTCTGTGGTCAGCAAGGCTCCCCTCTGGGCAGGCCCC 

CTGGTCAGAGGGAGTTTGTATCCTCAGGGAGGAGTTTGCGGCTGACCTTCCGCACACAGCCTTCCTCGGAGAACAAGACT 

GCCCACCTCCACAAGGGCTTCCTGGCCCTCTACCAAACCGTGGCTGTGAACTATAGTCT^GCCCATmGCGAGGCCAG^^ 

GGGCTCTGAGGCCATCAACGCACCTGGAGACAACCCTGCCAAGGTCCAGAACCACTGCCAGGAGCCCTATTATCAGGCCG 

OXSCAGCAGGGGCACTCACCTGTGCAACCCCAGGGACCTGGAAAGACAGACAGGATGGGGAGGAGGTTCTTCAG 

CCTGTCTGCGGACGGCCAGTCACCCCCATTGCCCAGAATCAGACGACCCTCGGTTCTTCCAGAGCCAAGCTGGGCAACTT 

CCCCTGGCAAGCCTTCACCAGTATCCACGGCCGTGGGGGCGGGGCCCTGCTGGGGGACAGATGGATCCTCACTGCTGCCC 

ACACCATCTACCCCAAGGACAGTGTTTCTCTCAGGAAGAACCAGAGTGTGAATGTGTTCTTGGGCCACACAGCCATAGAT 

GAGATGCTGAAACTGGGGAACCACCCTGTCCACCGTGTCGTTGTGCACCCCGACTACCGTCAGAATGAGTCCCATAACTT 

TAGCGGGGACATCGCCCTCCTGGAGCTGCAGCACAGCATCCCCCTGGGCCCCAACGTCCTCCCGGTCTGTCTGCCCGATA 

ATGAGACCCTCTACCGCAGCGGCTTGTTGGGCTACGTCAGTGGGTTTGGCATGGAGATGGGCTGGCTAACTACTGAGCTG 

AAGTACTCGAGGCTGCCTGTAGCTCCCAGGGAGGCCTGCAACGCCTGGCTCCAAAAGAGACAGAGACCCGAGGTGTTTTC 

TGACTWVTATGTTCTGTGTTGGGGATGAGACGCAAAGGCACAGTGTCTGCCAGGGGGACAGTGGCAGCCTCTATC 

GGGACAATCATGCCCATCACTGGGTGGCCACGGGCATTGTGTCCTGGGGCATAGGGTGTGGCGAAGGGTA 

ACCAAGGTGCTCAGCTATGTGGACTGGATCAAGGGAGTGATGAATGGCAAGAATTG ACCCTGGGGGCTTGAACAGGGACT 

GACCAGCTVCAGTGGAGGCCCCAGGCAACAGAGGGCCTGGAGTGAGGACTGAACACTGGGGTAGGGGGT^ 

TGCAGTGGCTTGGTGCAACAGTGATGTG/^TAGGATTTCCCTTTTTTTTTTTTTTTTTAAAAAAAAA 
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The FCTR5 pro^ encoded by SEQ ID NO: 16 has 487^uio acid residues, and is 
presented using the onSRer code in Table 5B. FCTR5 was seaMed against other databases 
using SignalPep and PSort search protocols. The FCTR5 protein is most likely microbody 
(peroxisome) (Certainty=0.6406) and seems to have no N-terminal signal sequence. The 
predicted molecular weight of FCTR5 protein is 5351 1.9 daltons. 



Table 5B. Encoded FCTRSa protein sequence (SEQ ID NO: 17). 

MPGPRWGKYLWRSPHSKGCPGAMWWLLLWGVLQACPTRGSVLLAQELPQQLTSPGYPEPYGKGQESSTDIKAPEGFAVRLVFQDF 

DLEPSQDCAGDSVTISFVGSDPSQFCGQQGSPLGRPPGQREFVSSGRSLRLTFRTQPSSENKTAHLHKGFLALYQTVAVNYSQPIS 

EASRGSEAINAPGDNPAKVQNHCQEPYYQAAAAGALTCATPGTWKDRQDGEEVLQCMPVCGRPVTPIAQNQTTLGSSRAKLGNFPW 

QAFTSIHGRGGGALLGDRWILTAAHTIYPKDSVSLRKNQSVNVFLGHTAIDEMLKLGNHPVHRVVVHPDYRQNESHNFSGDI^ 

LQHSIPLGPNVLPVCLPDNETLYRSGLLGYVSGFGMEMGWLTTELKYSRLPVAPREACNAWLQKRQRPEVFSDNMFCVGDE^ 

VCQGDSGS L YWWDNHAHHWVATG I VS WG I GCGEG YDF YTKVLS YVDW I KG VMNGKN 



An alternative embodiment, FCTR5b, is a 1691 base sequence shown in Table 5C. 
Table SC. FCTRSb Nucleotide Sequence (SEQ ID NO: 18) 

TTTTTTTTTAAAAAAAAAAAAAAAAAGGGAAATCCTATTCACATCACTGTTGCACCAAGCCACTGCAAGAGA^ 

ACCCCAGTGTTCAGTCCTCACTCCAGGCCCTCTGTTGCCTGGGGCCTCCACTGTGCTGGTCAGTCCCTGTTCAAGCCCCCAGGGTC 

AATTCTTGCCATTCATCACTCCCTTGATCCAGTCCACATAGCTGAGCACCTTGGTGTAGAAGTCATACCCTTCGCCAC^ 

CCCCAGGACACAATGCCCGTGGCCACCCAGTGATGGGCATGATTGTCCCATACCACATAGAGGCTGCCACTGTCCCCCTG^ 

ACTGTGCCTTTGCGTCTCATCCCCAACACAGAACATATTGTCAGAAAAC7VCCTCGGGTCTCTGTCTCTTTTGGAGCCAGGCGTTGC 

AGGCCTCCCTGGGAGCTACAGGCAGCCTCGAGTACTTCAGCTCAGTAGTTAGCCAGCCCT^TCTCCATGCCAAACCC^ 

CCCAACAAGCCGCTGCGGTAGAGGGTCTCATTATCGGGCAGACT^GACCGGGAGGACGTTGGGGCCCAGGGGGATGCTGTGCTGCAG 

CTCCAGGAGGGCGATGTCCCCGCTAAAGTTATGGGACTCT^TTCTGACGGTAGTCGGGGTGCACAACGACACGGTGGACA 

TCCCCAGTTTCAGCATCTCATCTATGGCTGTGTGGCCCAAGAACACATTCACACTCTGGTTCTTCCTGAGAGAA^ 

GGGTAGATGGTGTGGGCAGCAGTGAGGATCCATCTGTCCCCCAGCAGGGCCCCGCCCCCACGGCCGTGGATACTGGTGAAGGCTTG 

CCAGGGGAAGTTGCCCAGCTTGGCTCTGGAAGAACCGAGGGTCGTCTGATTCTGGGCAATGGGGGTGACTGGCCGTCCGCTVGAC^ 

GCATACACTGAAGAACCTCCTCCCCATCCTGTCTGTCTTTCCAGGTCCCTGGGGTTGCACAGGTGAGTGCCCCTGCTGCCGC^ 

TGATAATAGGGCTCCTGGCAGTGGTTCTGGACCTTGGCAGGGTTGTCTCCAGGTGCGTTGATGGCCTCAGAGCCCCTGCTGGCCTC 

GCTGATGGGCTGACTATAGTTCACAGCCACGGTTTGGTAGAGGGCCAGGAAGCCCTTGTGGAGGTGGGCAGTCTTGTTCTCCGAGG 

AAGGCTGTGTGCGGAAGGTCAGCCGCAAACTCCTCCCTGAGGATACAAACTCCCTCTGACCTIGGGGGCCTGCCCAGAGGGGAGCCT 

TGCTGACCACAGAACTGGCTTGGATCCGAACCGACGAATGAGATTGTGACAGAGTCCCCTGCACAGTCCTGGGACGGCTCCAG^ 

GAAGTCCTGGAAGACGAGCCTCACAGCAAAGCCCTCTGGAGCCTTGATGTCCGTGCTGCTCTCTTGGCCTTTGCCATACGGCTCTG 

GGTACCCGGGGGATGTCAGCTGCTGGGGTAGCTCTTGGGCCAAGAGGACGGAGCCCCGGGTTGGGCAAGCCTGGAGGACTCCCC^ 

AGAAGCAGCCACCACATTGCGCCTGGACAGCCTTTGGAGTGAGGGCTTCTCCAGAGATATTTCCCCCACACTCTGGGTCC^ 

CTGGAACTGGACATCTGGGACCTGCGAGAGAACTGGCCCAGGATAGGGAACAAAAGG 



The FCTR5b protein encoded by SEQ ID NO: 18 has 487 amino acid residues, and is 
presented using the one-letter code in Table 5D. FCTR5 was searched against other databases 
using SignalPep and PSort search protocols. The FCTR5b protein is most likely microbody 
(peroxisome) (Certainty=0.6406) and seems to have no N-terminal signal sequence. The 
predicted molecular weight of FCTR5 protein is 5351 1.9 daltons. 

Table 5D. Encoded FCTRSb protein sequence (SEQ ID NO:19). 

MPGPRVWGKYLWRSPHSKGCPGAMWWLLLWGVLQACPTRGSVLLAQQLPQQLTSPGYPEPYGKGQESSTDIKAPEGFAVRLVFQDF 
DLEPSQDCAGDSVTISFVGSDPSQFCGQQGSPIX3RPPGQREFVSSGRSLRLTFRTQPSSENKTAHIiHKGFL»ALYQWAV^r^ 
EASRGSEAINAPGDNPAKVQNHCQEPYYQAAAAGALTCATPGTWKDRQDGEEVI^CMPVCGRPVTPIAQNQTTLGSSRAi^ 
QAFTSIHGRGGGALLGDRWILTAAHTIYPKDSVSLRKNQSVNVFLGHTAIDEMLKLGNHPVHRVW 

LQHSIPLGPbm.PVCLPDNETLYT^SGLLGYVSGFGMEMGWLTTELKYSRLPVAPREACNAWLQKRQRPEVFSDNMFCVGDETO 
VCQGDSGSLYVVWDNHAHHWATGIVSWGIGCGEGYDFYTKVLSYVDWIKGVMNGKN 
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The predicted aij^ap acid sequence was searched in the Q^kcly available GenBank 
database FCTRSa protSRiowed 58 % identities (177 over 302^mino acids) and 74 % 
homologies (226 over 302 amino acids) with human complement CIR component precursor 
(EC 3.4.21.41) (705 aa.; ACC:P00736). Based upon homology, FCTR5 proteins and each 
homologous protein or peptide may share at least some activity. 

In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence the nucleotides 17-1594 of FCTR5a have 1575 of 1578 bases (99 %) identical to Homo 
sapiens complement Clr-like proteinase precursor (GENBANK-ID: XM_007061 .1) (SEQ ID 
NO:78) (Table 5E). 

Table 5E. BLASTN of FCTRSa against Homo sapiens complement Clr-like proteinase 

precursor (SEQ ID NO:78) 

>GI 1 11436767 |REF|XM_007061.1 I HOMO SAPIENS COMPLEMENT CIR-LIKE PROTEINASE PRECURSOR, 
{L0C51279) , 

MRNA 
LENGTH = 3318 



SCORE = 3104 BITS (1566), EXPECT = 0.0 
IDENTITIES = 1575/1578 (99%) 
STRAND = PLUS / PLUS 



CAGATGTCCAGTTCCAGATGCCTGGACCCAGAGTGTGGGGGAAATATCTCTGGAGAAGCC 7 6 

IIIIIMIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIII 

CAGATGTCCAGTTCCAGATGCCTGGACCCAGAGTGTGGGGGAAATATCTCTGGAGAAGCC 6 0 
CTCACTCCAAAGGCTGTCCAGGCGCAATGTGGTGGCTGCTTCTCTGGGGAGTCCTCCAGG 13 6 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIM 

CTCACTCCAAAGGCTGTCCAGGCGCAATGTGGTGGCTGCTTCTCTGGGGAGTCCTCCAGG 120 
CTTGCCCAACCCGGGGCTCCGTCCTCTTGGCCCAAGAGCTACCCCAGCAGCTGACATCCC 196 

llllilllllllllllllllllllllllllllillllllllllllllllllllMlllii 

CTTGCCCAACCCGGGGCTCCGTCCTCTTGGCCCAAGAGCTACCCCAGCAGCTGACATCCC 180 
CCGGGTACCCAGAGCCGTATGGCAAAGGCCAAGAGAGCAGCACGGACATCAAGGCTCCAG 256 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIillll 

CCGGGTACCCAGAGCCGTATGGCAAAGGCCAAGAGAGCAGCACGGACATCAAGGCTCCAG 240 



QUERY : 


17 


SB JCT : 


1 


QUERY : 


77 


SB JCT : 


61 


QUERY : 


137 


SB JCT : 


121 


QUERY : 


197 


SB JCT : 


181 


QUERY: 


257 


SBJCT : 


241 


QUERY: 


317 


SBJCT : 


301 


QUERY: 


377 


SBJCT : 


361 


QUERY: 


437 


SBJCT : 


421 


QUERY: 


497 


SBJCT : 


481 


QUERY : 


557 



AGGGCTTTGCTGTGAGGCTCGTCTTCCAGGACTTCGACCTGGAGCCGTCCCAGGACTGTG 

lllllllllliMIIIIMIIIIIIMIIIIIIIIIIMIIIIIIIMIIIIMIIIIM 

AGGGCTTTGCTGTGAGGCTCGTCTTCCAGGACTTCGACCTGGAGCCGTCCCAGGACTGTG 



316 



300 



376 



CAGGGGACTCTGTCACAATCTCATTCGTCGGTTCGGATCCAAGCCAGTTCTGTGGTCAGC 

lllllllllliMlilllllllillllllllllllllillMIMIIIIIIIIIIIIIil 

CAGGGGACTCTGTCACAATCTCATTCGTCGGTTCGGATCCAAGCCAGTTCTGTGGTCAGC 360 
AAGGCTCCCCTCTGGGCAGGCCCCCTGGTCAGAGGGAGTTTGTATCCTCAGGGAGGAGTT 436 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMMMIIIIIIIIIIIIilil 

AAGGCTCCCCTCTGGGCAGGCCCCCTGGTCAGAGGGAGTTTGTATCCTCAGGGAGGAGTT 420 
TGCGGCTGACCTTCCGCACACAGCCTTCCTCGGAGAACAAGACTGCCCACCTCCACAAGG 496 

IIIIIMIIIIIIMMIIIIIIMIIMIMIIIIMIIIIIIIIIMIIIMIIIIII 

TGCGGCTGACCTTCCGCACACAGCCTTCCTCGGAGAACAAGACTGCCCACCTCCACAAGG 4 80 
GCTTCCTGGCCCTCTACCAAACCGTGGCTGTGAACTATAGTCAGCCCATCAGCGAGGCCA 556 

IIIIIMIIMIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIilllllllllllllll 

GCTTCCTGGCCCTCTACCAAACCGTGGCTGTGAACTATAGTCAGCCCATCAGCGAGGCCA 540 
GCAGGGGCTCTGAGGCCATCAACGCACCTGGAGACAACCCTGCCAAGGTCCAGAACCACT 616 

lllllllllllllllllllllillllllllllllllllllllllllllllllllllllll 
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10 



15 



20 



25 



m 

V-.i 



35 



f=l 40 



in 

r=i 45 



50 



55 



60 



65 



70 



SB JCT : 
QUERY : 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY : 
SB JCT : 
QUERY: 
SB JCT : 
QUERY : 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 
QUERY : 
SB JCT : 
QUERY : 
SB JCT : 
QUERY : 
SB JCT : 
QUERY : 
SB JCT : 
QUERY: 
SB JCT : 
QUERY: 
SB JCT : 



541 GCAGGGGCTCTGAGGCCATCAACGCACCTGGAGACAACCCTGCCAAfiGTCCAGAACCACT 600 



617 GCCAGGAGi^BvTTATCAGGCCGCGGCAGCAGGGGCACTCACC'I^PCAACCCCAGGGA 676 

llllilliimMilllilMMIIIIIIIIIIIIIIIIIIMIIIIIIiliMMIII 

601 GCCAGGAGCCCTATTATCAGGCCGCGGCAGCAGGGGCACTCACCTGTGCAACCCCAGGGA 660 
677 CCTGGAAAGACAGACAGGATGGGGAGGAGGTTCTTCAGTGTATGCCTGTCTGCGGACGGC 736 

IIIIIIIIIMIIIIIIIIIIIIMIIIIillllMIIIIMMIMIIIIIIMMIII 

661 CCTGGAAAGACAGACAGGATGGGGAGGAGGTTCTTCAGTGTATGCCTGTCTGCGGACGGC 720 

73 7 CAGTCACCCCCATTGCCCAGAATCAGACGACCCTCGGTTCTTCCAGAGCCAAGCTGGGCA 796 

llllllllillllllllllllMlllllllllllilllllllllllllllMIIIIIIII 
721 CAGTCACCCCCATTGCCCAGAATCAGACGACCCTCGGTTCTTCCAGAGCCAAGCTGGGCA 780 

797 ACTTCCCCTGGCAAGCCTTCACCAGTATCCACGGCCGTGGGGGCGGGGCCCTGCTGGGGG 856 

IIIIIIIIIMIIIIIMMMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIM 

781 ACTTCCCCTGGCAAGCCTTCACCAGTATCCACGGCCGTGGGGGCGGGGCCCTGCTGGGGG 840 
857 ACAGATGGATCCTCACTGCTGCCCACACCATCTACCCCAAGGACAGTGTTTCTCTCAGGA 916 

IIIIIIMIIIIIIIIIMIIMIMIII MlillllllllllllllllMMIIIIM 

841 ACAGATGGATCCTCACTGCTGCCCACACCGTCTACCCCAAGGACAGTGTTTCTCTCAGGA 900 



AA^TCCP 



917 



901 



977 



AGAACCAGAGTGTGAATGTGTTCTTGGGCCACACAGCCATAGATGAGATGCTGAAACTGG 

IIIMIMIIIIIIMMIIMIIIMIIIIMIIIIIIIIIIIIIIIIIMMIIIIM 

AGAACCAGAGTGTGAATGTGTTCTTGGGCCACACAGCCATAGATGAGATGCTGAAACTGG 



976 



960 



GGAACCACCCTGTCCACCGTGTCGTTGTGCACCCCGACTACCGTCAGAATGAGTCCCATA 103 6 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
961 GGAACCACCCTGTCCACCGTGTCGTTGTGCACCCCGACTACCGTCAGAATGAGTCCCATA 1020 

1037 ACTTTAGCGGGGACATCGCCCTCCTGGAGCTGCAGCACAGCATCCCCCTGGGCCCCAACG 1096 

IIIIIIIIIMIIIMIIIMIIIIillllllllllllllllllllllllllllllllll 

1021 ACTTTAGCGGGGACATCGCCCTCCTGGAGCTGCAGCACAGCATCCCCCTGGGCCCCAACG 1080 

1097 TCCTCCCGGTCTGTCTGCCCGATAATGAGACCCTCTACCGCAGCGGCTTGTTGGGCTACG 1156 

llllllllililllllllllllllllllllllllllllllllllllllllllllllllll 
1081 TCCTCCCGGTCTGTCTGCCCGATAATGAGACCCTCTACCGCAGCGGCTTGTTGGGCTACG 1140 

1157 TCAGTGGGTTTGGCATGGAGATGGGCTGGCTAACTACTGAGCTGAAGTACTCGAGGCTGC 1216 

llllllllllilllllllllllllllllllllllllllllllllllllllllllllllll 
1141 TCAGTGGGTTTGGCATGGAGATGGGCTGGCTAACTACTGAGCTGAAGTACTCGAGGCTGC 1200 

1217 CTGTAGCTCCCAGGGAGGCCTGCAACGCCTGGCTCCAAAAGAGACAGAGACCCGAGGTGT 1276 

Mllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
1201 CTGTAGCTCCCAGGGAGGCCTGCAACGCCTGGCTCCAAAAGAGACAGAGACCCGAGGTGT 12 60 

1277 TTTCTGACAATATGTTCTGTGTTGGGGATGAGACGCAAAGGCACAGTGTCTGCCAGGGGG 1336 

IIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIilllMIIIIIII 
1261 TTTCTGACAATATGTTCTGTGTTGGGGATGAGACGCAAAGGCACAGTGTCTGCCAGGGGG 1320 

1337 ACAGTGGCAGCCTCTATGTGGTATGGGACAATCATGCCCATCACTGGGTGGCCACGGGCA 13 96 

Illllllllll llllllllllllllllllllllllllllllllllllllllllllllll 
1321 ACAGTGGCAGCGTCTATGTGGTATGGGACAATCATGCCCATCACTGGGTGGCCACGGGCA 1380 

1397 TTGTGTCCTGGGGCATAGGGTGTGGCGAAGGGTATGACTTCTACACCAAGGTGCTCAGCT 1456 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
13 81 TTGTGTCCTGGGGCATAGGGTGTGGCGAAGGGTATGACTTCTACACCAAGGTGCTCAGCT 1440 

1457 ATGTGGACTGGATCAAGGGAGTGATGAATGGCAAGAATTGACCCTGGGGGCTTGAACAGG 1516 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
1441 ATGTGGACTGGATCAAGGGAGTGATGAATGGCAAGAATTGACCCTGGGGGCTTGAACAGG 1500 

1517 GACTGACCAGCACAGTGGAGGCCCCAGGCAACAGAGGGCCTGGAGTGAGGACTGAACACT 1576 

lllllllllllllllllllllllllllllllllllllllllllillllllllllllllll 

1501 GACTGACCAGCACAGTGGAGGCCCCAGGCAACAGAGGGCCTGGAGTGAGGACTGAACACT 1560 

1577 GGGGTAGGGGGTGGGGGT 1594 

llllllllil IIIIIM 
1561 GGGGTAGGGGTTGGGGGT 1578 
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In this search it^s also found that the FCTRSa nucleic j«d had homology to three 
fragments of Homo sa^KK complement component 1, r subcomj^ent. It has 102 of 1 17 bases 
(87%) identical to 1458-1574, 82 of 94 bases (87%) identical to 2052-2145, and 54 of 63 bases 
(85%) identical to 1678-1740 all fragments of Homo sapiens complement component 1, r 
5 subcomponent (GenBank Acc: NM_001733.1) (Table 5F). 

Table 5F. BLASTN of FCTRSa against Homo sapiens complement component 1, r 

subcomponent (SEQ ID NO:79) 

>Glt4502492|REF(NM_001733.l| HOMO SAPIENS COMPLEMENT COMPONENT 1, R SUBCOMPONENT 
(CIR) , MRNA 
10 LENGTH = 2386 

SCORE = 113 BITS (57), EXPECT = 3E-22 
IDENTITIES = 102/117 (87%) 
STRAND = PLUS / PLUS 

15 

QUERY: 783 

lllllll lllllllllllllllllll I Mlllll IIIMIII II IIMIIII 

AGCCAAGATGGGCAACTTCCCCTGGCAGGTGTTCACCAACATCCACGGGCGCGGGGGCGG 
GGCCCTGCTGGGGGACAGATGGATCCTCACTGCTGCCCACACCATCTACCCCAAGGA 8 9! 

□ I III II II II II III I II I II Mill I I III IIMIIII I II II II Ml I 

; 5 c 
■sir 

p IDENTITIES = 82/94 (87%) 

STRAND = PLUS / PLUS 

QUERY : 13 80 CTGGGTGGCCACGGGCATTGTGTCCTGGGGCATAGGGTGTGGCGAAGGGTATGACTTCTA 

llllllllllllllllll IIIIMIIIIIIII Mill II II MM MUM 

CTGGGTGGCCACGGGCATCGTGTCCTGGGGCATCGGGTGCAGCAGGGGCTATGGCTTCTA 

CACCAAGGTGCTCAGCTATGTGGACTGGATCAAG 1473 
llllll lllllll III lllllllllllllll 





QUERY : 


783 




SB JCT : 


1458 


20 


QUERY : 


843 




SB JCT : 


1518 


25 


SCORE 


= 91. 





30 


QUERY : 


1380 






SB JCT : 


2052 


= = 1 
r^r 




QUERY: 


1440 


f=i 


35 


SB JCT : 


2112 



SCORE =54.0 BITS (27), EXPECT = 2E-04 
IDENTITIES = 54/63 (85%) 
40 STRAND = PLUS / PLUS 



QUERY: 


1006 


SB JCT : 


1678 


QUERY: 


1066 


SB JCT : 


1738 



mil iiiiiiiiiiii IIMIIII I II III iiiiiiiiiiiiii llllll 



II 



50 The amino acid sequence of the protein of FCTRSa 485 of 487 amino acid residues 

(99%) identical to, and 487 of 487 residues (100%) positive with, the 487 amino acid 
complement Clr-like proteinase precursor from Homo sapiens (GenBank- ACC: AAF44349.1) 
(SEQ ID NO:80) (Table 5G). 
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like proteinase precursor 



Table 5G. BLASTP q^^fTRSa and b against Complement 

(SEQ ID NO:80) 

>GI|7706083 |REF|NP_057630.l| COMPLEMENT CIR-LIKE PROTEINASE PRECURSOR, [HOMO SAPIENS] 
GI|11436768|REF|XP_007061.1| COMPLEMENT CIR-LIKE PROTEINASE PRECURSOR, [HOMO SAPIENS] 
GI|7271475|GB|AAF44349.1|AF178985_1 (AF178985) COMPLEMENT CIR-LIKE PROTEINASE 

PRECURSOR [HOMO SAPIENS] 
LENGTH = 487 

SCORE = 972 BITS (2513), EXPECT =0.0 

IDENTITIES = 485/487 (99%), POSITIVES = 487/487 (100%) 

R 

MPGPRVWGKYLWRSPHSKGCPGAMWWLLLWGVLQACPTRGSVLLAQELPQQLTSPGYPEP 6 0 

IIIIIIIIIIIIIIIIIIIIIMIIillllMllillMllilllllllllllMIIIII 

MPGPRVWGKYLWRSPHSKGCPGAMWWLLLWGVLQACPTRGSVLLAQELPQQLTSPGYPEP 6 0 



QUERY: 


1 


SB JCT : 


1 


QUERY: 


61 


SB JCT : 


61 


QUERY: 


121 


SB JCT : 


121 


QUERY: 


181 


SB JCT : 


181 


QUERY : 


241 


oQ(JL.i : 




QUERY: 


301 


SB JCT: 


301 


QUERY : 


361 


SB JCT : 


361 


QUERY: 


421 


SB JCT : 


421 


QUERY: 


481 


SBJCT : 


481 



YGKGQESSTDIKAPEGFAVRLVFQDFDLEPSQDCAGDSVTISFVGSDPSQFCGQQGSPLG 

llllllllllillllllllllllllllllllllllllllllillllllllllMlillll 

YGKGQESSTDIKAPEGFAVRLVFQDFDLEPSQDCAGDSVTISFVGSDPSQFCGQQGSPLG 



120 



120 



RPPGQREFVSSGRSLRLTFRTQPSSENKTAHLHKGFLALYQTVAVNYSQPISEASRGSEA 180 

MlllillllMIIIIIIIIIIIIIIIIIIIMMIilMIIIIIMIIIIIIIIIIIII 

RPPGQREFVSSGRSLRLTFRTQPSSENKTAHLHKGFLALYQTVAVNYSQPISEASRGSEA 180 



INAPGDNPAKVQNHCQEPYYQAAAAGALTCATPGTWKDRQDGEEVLQCMPVCGRPVTPIA 

lillliillllllillllllMIIIIIIIIIIIMIillllllllllllillllllllll 

INAPGDNPAKVQNHCQEPYYQAAAAGALTCATPGTWKDRQDGEEVLQCMPVCGRPVTPIA 



240 



240 



QNQTTLGSSRAKLGNFPWQAFTSIHGRGGGALLGDRWILTAAHTIYPKDSVSLRKNQSVN 300 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIklllllllllllllll 

QNQTTLGSSRAKLGNFPWQAFTSIHGRGGGALLGDRWILTAAHTVYPKDSVSLRKNQSVN 300 

VFLGHTAIDEMLKLGNHPVHRVWHPDYRQNESHNFSGDIALLELQHSIPLGPNVLPVCL 360 

lllllllllllllllllllllllllllllllilllllllllllllllllllllllMMI 
VFLGHTAIDEMLKLGNHPVHRVWHPDYRQNESHNFSGDIALLELQHSIPLGPNVLPVCL 360 

PDNETLYRSGLLGYVSGFGMEMGWLTTELKYSRLPVAPREACNAWLQKRQRPEVFSDNMF 420 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIillllllllllllllllllllllllllllll 
PDNETLYRSGLLGYVSGFGMEMGWLTTELKYSRLPVAPREACNAWLQKRQRPEVFSDNMF 420 

CVGDETQRHSVCQGDSGSLYWWDNHAHHWVATGI VSWGIGCGEGYDFYTKVLSYVDWIK 4 80 

MllllllllllllllliHIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
CVGDETQRHSVCQGDSGSVYWWDNHAHHWVATGIVSWGIGCGEGYDFYTKVLSYVDWIK 480 



Mill 



R = AT RESIDUE 
IS THE SAME. 



46, FCTR5B DIFFERS FROM FCTR5A IN THAT Q46R. THE REST OF THE HOMOLOGY 



The fiill amino acid sequence of the protein of FCTRSa has 175 of 303 amino acid 
residues (58%) identical to, and 226 of 303 residues (74%) positive with the 400-701 amino acid 
segment, 72 of 157 residues (45%) identical and 94 of 157 residues (59%) positive with amino 
acids 1-155, and 36 of 139 residues (25%) identical and 58 of 139 residues (40%) positive with 
amino acids 188-312 of the 705 amino acid Complement CIR Component Precursor from Homo 
sapiens (GenBank-ACC: AAA51851.1) (SEQ ID NO:43) (Table 5H). 
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Table 5H. BLASTPj|£FCTR5a and b against Complemeni^R Component Precursor 



(SEQ ID NO:81) 



>GI|115204|SP|P00736|C1R_HUMAN COMPLEMENT CIR COMPONENT PRECURSOR 
GI I 67614 I PIR I I CIHURB COMPLEMENT SUBCOMPONENT CIR (EC 3.4.21.41) PRECURSOR 
Gl|l79644|GB|AAA51851.l| (M14058) HUMAN COMPLEMENT CIR [HOMO SAPIENS] 
LENGTH = 70S 



HUMAN 



SCORE = 361 BITS (928), EXPECT = 8E-99 

IDENTITIES = 175/303 (58%), POSITIVES = 226/303 (74%), GAPS = 9/303 (2%) 



QUERY: 


189 


SB JCT : 


400 


QUERY: 


241 




^ D U 


QUERY: 


301 


SB JCT : 


519 


QUERY: 


361 


SB JCT : 


579 


QUERY: 


421 


SB JCT: 


639 


QUERY: 


481 


SB JCT : 


699 



AKVQNHCQEPYYQ AAAAGALTCATPGTWKDRQDGEEVLQCMPVCGRPVTPIA 24 0 

h-^l +1 lllh IN 111+111++ +I + IIII + II 1 + 

ARIQYYCHEPYYKMQTRAGSRESEQGVYTCTAQGIWKNEQKGEKIPRCLPVCGKPVNPVE 4 59 

QNQTTLGSSRAKLGNFPWQAFTSIHGRGGGALLGDRWILTAAHTIYPKDSVSLRKNQSVN 300 

II +1 +II+IIIIII ll+lllllllltMIIIIIIIIII+lll+ + + I I++ 
QRQRIIGGQKAKMGNFPWQVFTNIHGRGGGALLGDRWILTAAHTLYPKEHEA-QSNASLD 518 

VFLGHTAIDEMLKLGNHPVHRVWHPDYRQNESHNFSGDI ALLELQHS I PLGPNVLPVCL 360 

mill ++I++IIIIII+ II lllllll+IMI IIIIIIII++I+ llll+IMI 
VFLGHTNVEELMKLGNHPIRRVSVHPDYRQDESYNFEGDIALLELENSVTLGPNLLPICL 578 

PDNETLYRSGLLGYVSGFGMEMGWLTTELKYSRLPVAPREACNAWLQKRQRPEVFSDNMF 420 

lll+l I IMI1IIII+ + +I++ Hill +11 11+ + I +111 III 
PDNDTFYDLGLMGYVSGFGVMEEKIAHDLRFVRLPVANPQACENWLRGKNRMDVFSQNMF 63 8 

CVGDETQRHSVCQGDSGSLYWWDNHAHHWVATGIVSWGIGCGEGYDFYTKVLSYVDWIK 480 

II + + mill ++ II + iiiiiiiiiiiii II iiiiii+iimi 

CAGHPSLKQDACQGDSGGVFAVRDPNTDRWVATGI VSWGIGCSRGYGFYTKVLNYVDWIK 698 



SCORE = 122 BITS (306), EXPECT = lE-26 
IDENTITIES = 72/157 (45%), POSITIVES = 94/157 (59%), GAPS = 3/157 (1%) 

R 

QUERY: 24 MWWLLLWGVLQACPTRGSVLLAQELPQQLTSPGYPEPYGKGQESSTDIKAPEGFAVRLVF 83 

II I I I 11+ + l+l ++III +I+II I++I I I 1+ l+lll 

SBJCT: 1 MWLLYLLVPALFCRAGGSIPIPQKLFGEVTSPLFPKPYPNNFETTTVITVPTGYRVKLVF 60 

QUERY : 84 QDFDLEPSQDCAGDSVTISFVGSDPSQFCGQQGSPLGRPPGQREFVSSGRSLRLTFRTQP 143 

I IIIIII+ I I I II +111111111 III++II+I I + III I 

SBJCT : 61 QQFDLEPSEGCFYDYVKISADKKSLGRFCGQLGSPLGNPPGKKEFMSQGNKMLLTFHTDF 120 

QUERY: 144 SS-ENKTAHLHKGFLALYQTVAVNYSQPISEASRGSE 179 

1+ II 1 +IIIII II 11+ + I + I I 
SBJCT: 121 SNEENGTIMFYKGFI1AYYQ--AVDLDECASRSKSGEE 155 



SCORE =36.3 BITS (83), EXPECT =0.93 
IDENTITIES = 36/139 (25%), POSITIVES = 58/139 (40%), GAPS = 17/139 (12%) 

R 

QUERY: 35 ACPTRGSVLLAQELPQQLTSPGYPEPYGKGQESSTDIKAPEGFAVRLVF-QDFDLEPSQD 93 

+1 III ++I II I + 1+ I + II + II++ I 

SBJCT: 188 SCQAECSSELYTEASGYISSLEYPRSYPPDLRCNYSIRVERGLTLHLKFLEPFDIDDHQQ 247 

QUERY : 94 - -CAGDSVTISFVGSDPSQFCGQQGSPLGRPPGQREFVSSGRSLRLTFRTQPSSENKTAH 151 

I I + I I + +111+1 III + +1 ++ I 11 I ++ 
SBJCT: 248 VHCPYDQLQIYANGKNIGEFCGKQ RPP DLDTSSNAVDLLFFTDESGDS 295 

QUERY: 152 LHKGFLALYQTVAVNYSQP 170 

+1+ 11+ II 
SBJCT: 296 - -RGWKLRYTTEIIKCPQP 312 
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R = AT RESIDUE 46, FCTR5B DIFFERS FROM FCTR5A IN THAT Q46^^rHE REST OF THE HOMOLOGY 
IS THE SAME. 

Based upon homology, FCTR5 proteins and each homologous protein or peptide may 
share at least some activity. 



FCTR6 

The novel nucleic acid of 1078 nucleotides FCTR6a (also designated 27455183.0.19) 
encoding a novel human blood coagulation factor Xl-like protein is shown in Table 6A. An 
ORF was identified beginning with an ATG initiation codon at nucleotides 243-245 and ending 
with a TAA codon at nucleotides 1044-1046. A putative untranslated region upstream from the 
initiation codon and downstream from the termination codon is underlined in Table 6A5 and the 
start and stop codons are in bold letters. 

Table 6A FCTR6a Nucleotide Sequence (SEQ ID NO:20) 

TTGATCCGTGCCAAGTGGCTTTTTGTGGGCTCTGTAGAGTGCTCTAAACCCAGCTCGGCCTTTGCTGTATTAGACAGAAGCACCTC 

ATTCATATCCCTGGGGCCCCTGATGGTGCAGTGGTCTGGCTGTGGTCTGCACACCAGCTATTCTGTTTTGTTTTGTTTTGTTTTTT 

TCCTACCTTTTTCCAATCCTCACACCTTCTGATCAACAGCCCCAGTAGGGTTTAAAGGTCCTAGAGCTACA TGGGATTTAGGTT 

TGGGCACAGCCAATTCTGCCACTTTTGAGACTTCCCTTCCCCTTCCACTTGCCCCTCTCTGGTTCTCTGCCACCAGTCCAGAAGAA 

CTGAGTGTCGTGCTGGGGACCAACGACTTAACTAGCCCATCCATGGAAATAAAGGAGGTCGCCAGCATCATTCTTCACAAAGACTT 

TAAGAGAGCCAACATGGACAATGACATTGCCTTGCTGCTGCTGGCTTCGCCCATCAAGCTCGATGACCTGAAGGTGCCCATCTGCC 

TCCCCACGCAGCCCGGCCCTGCCACATGGCGCGAATGCTGGGTGGCAGGTTGGGGCCAGACCAATGCTGCTGACAAAAACTCTGTG 

AAAACGGATCTGATGAAAGTGCCAATGGTCATCATGGACTGGGAGGAGTGTTCAAAGATGTTTCCAAAACTTACCAAAAATATGCT 

GTGTGCCGGATACAAGAATGAGAGCTATGATGCCTGCAAGGGTGACAGTGGGGGGCCTCTGGTCTGCACCCCAGAGCCTGGTGAGA 

AGTGGTACCAGGTGGGCATC7VTCAGCTGGGGAAAGAGCTGTGGAGATAAGAACACCCCAGGGATATACACCTCGTTGGT 

AACCTCTGGATCGAGAAAGTGACCCAGCTAGGAGGCAGGCCCTTCAATGCAGAGAAAAGGAGGACTTCTGTCAAACAGAAACCTAT 

GGGCTCCCCAGTCTCGGGAGTCCCAGAGCCAGGCAGCCCCAGATCCTGGCTCCTGCTCTGTCCCCTGTCCCATGTGTTGTTCAGAG 

CTATTTTGTACTGATAATAAAATAGAGGCTATTCTTTCAACCGAAA 



The FCTR6a protein encoded by SEQ ID NO:20 has 267 amino acid residues and is 
presented using the one-letter code in Table 6B. FCTR6a was searched against other databases 
using SignalPep and PSort search protocols. The FCTR6a protein is most likely mitochondrial 
matrix space (Certainty= 0.4372) and seems to have no N-terminal signal sequence. The 
predicted molecular weight of FCTR6a protein is 29412.8 daltons. 

Table 6B. Encoded FCTR6a protein sequence (SEQ ID NO:21). 

MGFRFIiGTANSATFETSLPLPU^PLWFSATSPEELSVVLGTNDLTSPSMEIKEVASIILHKDFKRANMDNDIA^ 
KVPICLPTQPGPATWRECWAGWGQTNAADKNSVKTDLMKVPMVIMDWEECSKMFPKLTKNMLCAGYKNESYDACK^ 
PEPGEKWYQVGIISWGKSCGDKNTPGIYTSLVNYNLWIEKVTQLGGRPFNAEKRRTSVKQKPMGSPVSGVPEPGSPRSWLLLCPLS 
HVLFRAILY 

In an altemative embodiment, FCTR6b (alternatively referred to as 27455183.0.145) has 
the 1334 residue sequence shown in Table 6C. An ORF was identified beginning with an ATG 
initiation codon at nucleotides 499-501 and ending with a TAA codon at nucleotides 1300-1302. 
A putative untranslated region upstream from the initiation codon and downstream from the 
termination codon is underlined in Table 6C, and the start and stop codons are in bold letters. 
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Tabl^C FCTR6b Nucleotide Sequence (SH^D NO:22) 



GATTTTAGAAGGT1 ii ii li n i ^^TrnnrT GACAGTTTCTTCATGGCATAACCACAGAqlK'TGTGGCACCCGCTGT 

CGTGGGATATCAAATATCCTCTGGGGTTCGGAATGTGGGCTTATTACTGAAGATCCTGTCTGCTTGGTCAGTGGCAG^ 
TAGACTAACTTCTGGTCCTGAGTTTCTAAAGTGCTGGTAGACCAGTTGATACAAAACAGATATAATAATGAATGCCTTAT 
5 CTATCTGAAGGTCAGTTTGATCCGTGCCAAGTGGCTTTTTGTGGGCTGTGTAGAGTGCTCTAAAC 

CTGTATTAGACAGAAGCACCTCATTCATATCCCTGGGGCCCCTGATGGTGCAGTGGTCTGGCTGTGGTCTGCAC^ 
TATTCTGTTTTGTTTTGTTTTGTTTTGTTTTTTCCTACCTTTTTCCAATCCTCACACCTTCTGATCAACA 
GGTTTAAAGGTCCTAGAGCTACA TGGGATTTAGGTTTCTGGGCACAGCCAATTCTGCCACTTTTGAGACTTCCCTTCCCC 
TTCCACTTGCCCCTCTCTGGTTCTCTGCCACCAGTCCAGAAGAACTGAGTGTCGTGCTGGGGACCAACGACTTAACTAGC 

10 CCATCCATGGAAATAAAGGAGGTCGCCAGCATCATTCTTCACAAAGACTTTAAGAGAGCCAACATGGACAATGA^ 

CTTGCTGCTGCTGGCTTCGCCCATCAAGCTCGATGACCTGAAGGTGCCCATCTGCCTCCCCACGCAGCCCGGCCCTGCC^ 
CATGGCGCGAATGCTGGGTGGCAGGTTGGGGCCAGACCAATGCTGCTGACAAAAACTCTGTGAAAACGGATC 
GTGCCAATGGTCATCATGGACTGGGAGGAGTGTTCAAAGATGTTTCCAAAACTTACCTW^TATGCTC 
CAAGAATGAGAGCTATGATGCCTGCAAGGGTGACAGTGGGGGGCCTCTGGTCTGCACCCCAGAGCCTGGTGAGAAGTGGT 

1 5 ACCAGGTGGGCATCATCAGCTGGGGAAAGAGCTGTGGAGAGAAGAACACCCCAGGGATATACACCTCGTTGGTG^ 
AACCTCTGGATCGAGAAAGTGACCCAGCTAGAGGGCAGGCCCTTCAATGCT^GAGAAAAGGAGGACTTCTGTCAAAC^ 
ACCTATGGGCTCCCCAGTCTCGGGAGTCCCAGAGCCAGGCAGCCCCAGATCCTGGCTCCTGCTCTGTCCCCTGTCCCATG 
TGTTGTTCAGAGCTATTTTGTACTG ATAATAAAATAGAGGCTATTCTTTCAACCGAAA 

20 The FCTR6b protein encoded by SEQ ID NO:22 has 267 amino acid residues and is 

presented using the one-letter code in Table 6B. The Psort profile for FCTR4 predicts that this 
sequence has no N-terminal signal peptide and is likely to be localized at the mitochondrial 

i==i matrix space (Certainty=0.4372). The predicted molecular weight of this protein is 29498.9 

^ Daltons. 

g 25 Table 6D. Encoded FCTR6b protein sequence (SEQ ID NO:23). 

MGFRFLGTANSATFETSLPLPLAPLWFSATS PEELS WLGTNDLTS PSME I KEVAS 1 1 LHKDFKRANMDND lALLLLAS P I KLDDL 
[?i KVPICLPTQPGPATWRECWAGWGQTNAADKNSVKTDLMKVPMVIMDWEECSKMFPKLTKNMLCAGYK^ 

PEPGEKWYQVGIISWGKSCGEKNTPGIYTSLVNYNLWIEKVTQLEGRPFNAEKRRTSVKQKPMGSPVSGVPEPGSPRSWLLLCPLS 
£0 HVLFRAILY 

^ 30 

h| In a search of sequence databases, it was found, for example, that the FCTR6a nucleic 

\i acid sequence has 853 of 897 bases (95 %) identical to bases 551-1447, and 346 of 388 bases 

(89%) identical to bases 127-513 of Macaca fascicularis brain cDNA, clone QccE- 17034 
(GENBANK-ID: |AB046651) (Table 6E). 

35 Table 6E. BLASTN of FCTR6a against Macaca fascicularis brain cDNA, clone QccE-17034 

(SEQ ID NO:82) 

> GI I 9651112 |DBJ|AB046651 . 1 |AB0466S1 i^lACACA FASCICULARIS BRAIN CDNA, CLONE QCCE-17034 
LENGTH = 1746 

40 SCORE = 1429 BITS (721), EXPECT = 0.0 

IDENTITIES = 853/897 (95%) 
STRAND = PLUS / PLUS 

^UERY: 434 CCTTTTTCC7! 

45 I Ml II II MM I MM II II II III I 1 IIMIIIIIIIIIIII MM II I MM I 

CCTTTTTCCAATCCTCACACCTTCTGAGCTACAGCCCCAGTAGGGTCTAAATGTCCTAGA 
SCTACATGGGATTTAGGTTTCTGGGCACAGCCAATTCTGCCACTTTTGAGACTTCCCTTC 

MM III MMMMMMM MMMMMMM lllllllllll lllllllll 

QUERY: 554 i 

IIIM MM llllllllllllllllllllllllllllllllllllllll llllllll 



QUERY : 


434 


SB JOT : 


551 


QUERY: 


4 94 


SBJCT: 


611 


QUERY: 


554 
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SB JCT : 
QUERY: 
SB JCT : 
QUERY : 
SB JCT : 
QUERY : 
SB JCT : 
QUERY: 
SB JCT : 



671 
614 
731 
674 
791 
734 
851 
794 
911 



CCCTTTCACTCGCCCCTCTCTGGTTCTCTGCCACCAGTCCAGAAa 



::actcgci 
:ci^Plv< 



sWRcAGi 



.TGTCGTGC 730 



TGGGGACa^pS^CTTAACTAGCCCATCCATGGAAATAAAGGAGGTWCCAGCATCATTC 

lllllllillllllllllllill IIIIIIIIIMIIIMIIIIIIIIIIIIIIMIMI 

TGGGGACCAACGACTTAACTAGCTCATCCATGGAAATAAAGGAGGTCGCCAGCATCATTC 



673 



790 



TTCACAAAGACTTTAAGAGAGCCAACATGGACAATGACATTGCCTTGCTGCTGCTGGCTT 733 

IIIIIM IIIIIIIIIIIIIIMIIIMIIMMIIIMIIIIIIIIIIIIIIIIM I 

TTCACAAGGACTTTAAGAGAGCCAACATGGACAATGACATTGCCTTGCTGCTGCTGGCCT 850 
CGCCCATCAAGCTCGATGACCTGAAGGTGCCCATCTGCCTCCCCACGCAGCCCGGCCCTG 793 

MIIIMII IIIIIIIIIIIIIIIMIIIIIIIMIIIIM IIIIIM llllll I 

CGCCCATCACACTCGATGACCTGAAGGTGCCCATCTGCCTCCCTACGCAGCACGGCCCCG 910 

CCACATGGCGCGAATGCTGGGTGGCAGGTTGGGGCCAGACCAATGCTGCTGACAAAAACT 853 

IIIMMII llllllllllllllllllllllllllllllllllllllllllllllllll 
CCACATGGCACGAATGCTGGGTGGCAGGTTGGGGCCAGACCAATGCTGCTGACAAAAACT 970 



QUERY: 854 CTGTGAAAACGGATCTGATGAAAGTGCCAATGGTCATCATGGACTGGGAGGAGTGTTCAA 913 

llllllllllllllllllllllll III IIIIIIIIIIIIIIIIIMIIIIIIIIIIII 
SB JCT: 971 CTGTGAAAACGGATCTGATGAAAGCGCCGATGGTCATCATGGACTGGGAGGAGTGTTCAA 1030 

QUERY: 914 AGATGTTTCCAAAACTTACCAAAAATATGCTGTGTGCCGGATACAAGAATGAGAGCTATG 973 

II lllllllillll IIIMIIIIIIIIIIIIIII llllllll lllllllllllll 
SB JCT: 1031 AGGCGTTTCCAAAACTCACCAAAAATATGCTGTGTGCTGGATACAATAATGAGAGCTATG 1090 

QUERY: 974 ATGCCTGCAAGGGTGACAGTGGGGGGCCTCTGGTCTGCACCCCAGAGCCTGGTGAGAAGT 1033 

I llllll llllllllll Mil! Illlllllllllllllllllllllllllllllll 
SB JCT: 1091 ACGCCTGCCAGGGTGACAGCGGGGGACCTCTGGTCTGCACCCCAGAGCCTGGTGAGAAGT 1150 

QUERY: 1034 GGTACCAGGTGGGCATCATCAGCTGGGGAAAGAGCTGTGGAGAGAAGAACACCCCAGGGA 1093 

lllllllllllll MIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
SB JCT: 1151 GGTACCAGGTGGGTATCATCAGCTGGGGAAAGAGCTGTGGAGAGAAGAACACCCCAGGGA 1210 

QUERY: 10 94 TATACACCTCGTTGGTGAACTACAACCTCTGGATCGAGAAAGTGACCCAGCTAGAGGGCA 1153 

llllllllllllllllllllllllllllllllllllllll IIIIIIIIIIIIIMIIII 
SBJCT: 1211 TATACACCTCGTTGGTGAACTACAACCTCTGGATCGAGAAGGTGACCCAGCTAGAGGGCA 1270 
QUERY: 1154 GGCCCTTCAATGCAGAGAAAAGGAGGACTTCTGTCAAACAGAAACCTATGGGCTCCCCAG 1213 

lllllllll III IIIIIM llllll IIIIIIIIIIMMMIIIIMIIIIII M 

SBJCT: 1271 GGCCCTTCAGTGCGGAGAAAATGAGGACCTCTGTCAAACAGAAACCTATGGGCTCCCGAG 1330 
QUERY: 1214 TCTCGGGAGTCCCAGAGCCAGGCAGCCCCAGATCCTGGCTCCTGCTCTGTCCCCTGTCCC 1273 

IIIIIM IIIIIIIIIMIIII III IIIIIIIIIIMIIIIIIIIIMIIIIIIIII 

SBJCT: 1331 TCTCGGGGGTCCCAGAGCCAGGCGGCCTCAGATCCTGGCTCCTGCTCTGTCCCCTGTCCC 13 90 

QUERY: 1274 ATGTGTTGTTCAGAGCTATTTTGTACTGATAATAAAATAGAGGCTATTCTTTCAACC 1330 

IIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII III MM 
SBJCT : 1391 ATGTGTTGTTCAGAGCTATTTTGTACTGATAATAAAATAGAGGCTATTTTTTTAACC 144 7 

SCORE = 428 BITS (216), EXPECT = E-117 
IDENTITIES = 346/388 (89%), GAPS 1/388 (0%) 
STRAND = PLUS / PLUS 

QUERY: 1 GATTTTAGAAGGTTAATCAAAAACCCGGGGACAGTTTCTTCATGGCATAACCACAGACCT 60 

MIIIIIIMIIIIIMIIIIIIIM IIMIIIIII Mill llllllll Mill 

SBJCT: 127 GATTTTAGAAGGTTAATCAAAAACCCAAGGACAGTTTCATCATGTCATAACCAAAGACCC 186 

QUERY: 61 TTGTGGCACCCGCTGTCGTGGGATATCAAATATCCTCTGGGGTTCGGAATGTGGGCTTAT 120 

IIIIIIMII llllll lllllll llllllll I IIIMIII llllllll Mill 
SBJCT: 187 TTGTGGCACCTGCTGTCATGGGATAACAAATATCTTGTGGGGTTCTGAATGTGGACTTAT 246 



QUERY: 121 TACTGAAGATCCTGTCTGCTTGGTCAGTGGCAGGTCTAGACTAACTTCTGGTCCTGAGTT 

IIIMIII IlillllillillMIIIIII IIIIIIIIIIIIMMIIIIIIIIII I 

SBJCT: 247 TACTGAAGCTCCTGTCTGCTTGGTCAGTGG-TGGTCTAGACTAACTTCTGGTCCTGAGAT 



180 



305 



QUERY: 181 TCTAAAGTGCTGGTAGACCAGTTGATACAAAACAGATATAATAATGAATGCCTTATCTAT 240 

lllllllll lllllllll Mill I MM I IIIIIMIIIIIIIIIIIII MM 

SBJCT: 306 TCTAAAGTGTTGGTAGACCGGTTGAGATAAAAGATATATAATAATGAATGCCTTACCTAT 365 



QUERY: 241 CTGAAGGTCAGTTTGATCCGTGCCAAGTGGCTTTTTGTGGGCTGTGTAGAGTGCTCTAAA 
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mil iiujmmiijmii iiiiiiiiiiiiiii iiiiuiiii iiiii 

SBJCT: 366 CTGAAAAC C^MfGATCCGTGCCAAGGGGCTTTTTGTGGGCTCTC^^pGTGCCCTAAA 425 

QUERY: 3 01 CCCAGCTCGGCCTTTGCTGTATTAGACAGAAGCACCTCATTCATATCCCTGGGGCCCCTG 360 

I Mill II lllllllllll MMI IIIIIMM MUM III I II IIIII II 

SBJCT: 426 CCCAGCTCTGCCTTTGCTGTGTTAGACAGAAGCACGCCATTCACATCTCTGGGGCCCCCA 485 
QUERY: 361 ATGGTGCAGTGGTCTGGCTGTGGTCTGC 388 

Illllll MM III IIIIMIIII 

SBJCT: 4 86 ATGGTGCCATGGTGTGGTTGTGGTCTGC 513 

In a search of sequence databases, it was found, for example, that the FCTR6a nucleic 
acid sequence has 295 of 378 bases (78 %) identical to bases 410-779 of Mus musculus adult 
male testis cDNA, RIKEN full-length enriched (GENBANK-ID:AK09660) (Table 6F). 

Table 6F. BLASTN of FCTR6a against Mus musculus adult male testis cDNA, RIKEN full- 
length enriched (SEQ ID NO:83) 

> GI| 1285542 9|DBJ|AK016601.1|AK016601 MUS MUSCULUS ADULT MALE TESTIS CDNA, RIKEN FULL- 
LENGTH ENRICHED 

LIBRARY, CLONE :4933401F05, FULL INSERT SEQUENCE 
LENGTH = 104 7 

SCORE = 97.6 BITS (49), EXPECT = 2E-17 
IDENTITIES = 295/378 (78%), GAPS = 8/378 (2%) 
STRAND = PLUS / PLUS 

AACATGGACAATGACATTGCCTTGCTGCTGCTGGCTTCGCCCATCAAGCTCGATGACCTG 756 

lllllllllll llllllllllll Illllll II MM I I I II MM III 

AACATGGACAACGACATTGCCTTGTTGCTGCTAGCCAAGCCCTTGACGTTCAATGAGCTG 469 
AAGGTGCCCATCTGCCTCCCCACGCAGCCCGGCCCTGCCACATGGCGCGAATGCTGGGTG 816 

I IIIIIIMIIIIIII II IIIII Mil III MM lllllllllllll 

ACGGTGCCCATCTGCCTTCCTCTCTGGCCCGCCCCTCCCAGCTGGCACGAATGCTGGGTG 529 
GCAGGTTGGGGCCAGACCAATGCTGCTGACAAAAACTCTGTGAAAACGGATCTGATGAAA 876 

IIIII llllll Mill I Illllll I III II IIIIIIIMIIIIII 

GCAGGATGGGGCGTAACCAACTCAACTGACAAGGAATCTATGTCAACGGATCTGATGAAG 589 
GTGCCAATGGTCATCATGGACTGGGAGGAGTGTTCAAAGATGTTTCCAAAACTTACCAAA 936 

IIIII III IIIII 11 llllllll II I I IMIIIIIII I II MM I 

GTGCCCATGCGTATCATAGAGTGGGAGGAATGCTTACAGATGTTTCCCAGCCTCACCACA 64 9 

AATATGCTGTGTGCCGGATACAAGAATGAGAGCTATGATGCCTGCAAGGGTGACAGTGGG 996 

II llllllllllll III lllllllllll IIIII III Illllll 
AACATGCTGTGTGCCTCATATGGTAATGAGAGCTACGATGCTTGC CAGTGGG 701 

GGGCCTCTGGTCTGCACCCCAGAGCCTGGTGAGAAGTGGTACCAGGTGGGCATCATCAGC 1056 

II II II lllllllll MM IIIII I illllllllllllllllllllllll 

GGACCGCTTGTCTGCACCACAGATCCTGGCAGTAGGTGGTACCAGGTGGGCATCATCAGC 761 

TGGGGAAAGAGCTGTGGA 1074 

IIIII llllllllllll 
TGGGGCAAGAGCTGTGGA 779 

The FCTR6a amino acid has 247 of 267 amino acid residues (92%) identical to, and 251 
of 307 residues (94%) positive with, the 267 amino acid hypothetical protein [Macaca 
fascicularis] (GenBank: AB046651) (SEQ ID NO:84) (Table 6G). 



QUERY : 


697 


SBJCT : 


410 


QUERY: 


757 


SBJCT: 


470 


QUERY : 


817 


SBJCT : 


530 


QUERY : 


877 


SBJCT : 


590 


QUERY : 


937 


SBJCT : 


650 


QUERY : 


997 


SBJCT : 


702 


QUERY: 


105 


SBJCT: 


762 
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QUERY: 


1 


SBJCT: 


1 


QUERY: 


61 


SBJCT : 


61 


QUERY: 


121 


SBJCT : 


121 


QUERY : 


181 


SBJCT : 


181 


QUERY: 


241 


SBJCT : 


241 



Table 6G. BLASTP j^CTR6a and b against hypothetica|^|tein [Macaca fascicularis] 

(SEQ ID NO:84) 

> GI| 9651113 |DBJ|BAB03569.1| (AB046651) HYPOTHETICAL PROTEIN (MACACA FASCICULARIS] 
LENGTH =267 

SCORE = 467 BITS (1202), EXPECT = E-131 
IDENTITIES = 247/267 (92%), POSITIVES = 251/267 (94%) 

MGFRFLGTANSATFETSLPLPLAPLWFSATSPEELSWLGTNDLTSPSMEIKEVASIILH 60 

I MM IIII III IIII IMMIIIIIIIII^MIMIIIII IIIIIIIIIIIII 

MRFRFLSTANSPTFEASLPLSLAPLWFSATSPEELNWLGTNDLTSSSMEIKEVASIILH 60 
KDFKRANMDNDIALLLLAS P I KLDDLKVP I CLPTQPGPATWRECWVAGWGQTNAADKNS V 120 

iiiiiiiiiiiiiiiiMiii IIIIIIIIIIIII nil! iiiiiiiiiiiiiiiiii 

KDFKRANMDND I ALLLLAS P I TLDDLKVP I CLPTQHGPATWHE CWVAGWGQTNAADKNS V 120 
KTDLMKVPMVIMDWEECSKMFPKLTKNMLCAGYKNESYDACKGDSGGPLVCTPEPGEKWY 180 

lillll IIIIIIIIIIM IIIIMIIIIIII llllilkllllllllllllllllll 

KTDLMKAPMVIMDWEECSKAFPKLTKNMLCAGYNNESYDACQGDSGGPLVCTPEPGEKWY 180 

K E 
QVGIISWGKSCGDKNTPGIYTSLVNYNLWIEKVTQLGGRPFNAEKRRTSVKQKPMGSPVS 240 

llllllillllkllllllllllllllllllllill lllklll lllllllllll II 



lllllll lllllllllllllllllll 

GVPEPGGLRSWLLLCPLSHVLFRAILY 267 
K AND E ARE RESIDUES THAT DIFFER BETWEEN FCTR6A AND B. D193K, AND G217E. 

The FCTR6a amino acid has 80 of 201 amino acid residues (39%) identical to, and 1 19 
of 201 residues (58%) positive with, the 638 amino acid plasma kallikrein Bl precursor 
(GENBANK-ID:NP_000883.1) (SEQ ID NO:85) (Table 6H). 

Table 6H. BLAST? of FCTR6a and b against plasma kallikrein Bl precursor (SEQ ID 

NO:85) 

>GI|4504877|REF|NP_000883.1| PLAS^4A KALLIKREIN Bl PRECURSOR; KALLIKREIN, PLASMA; 
KALLIKREIN B 

PLASMA; KALLIKREIN 3, PLASMA; FLETCHER FACTOR [HOMO 
SAPIENS] 

GI|125184|SP|P03952|KAL HUMAN PLASMA KALLIKREIN PRECURSOR (PLASMA PRE KALLIKREIN) 
(KININOGENIN) 

{FLETCHER FACTOR) 

GI|67591|PIR| IKQHUP PLASMA KALLIKREIN (EC 3.4.21.34) PRECURSOR - HUMAN 
GI| 190263 |GB|AAA601S3 .1 I (M13143) PLASMA PREKALLIKREIN [HOMO SAPIENS] 
GI I 8809781 lGB|AAF7994a . 1 1 (AF232742) PLASMA KALLIKREIN PRECURSOR [HOMO SAPIENS] 
LENGTH = 638 

SCORE = 133 BITS (334), EXPECT = 3E-30 

IDENTITIES = 80/201 (39%), POSITIVES = 119/201 (58%), GAPS = 18/201 (8%) 
QUERY: 20 LPLAPLWFSATSPEELSWLGTNDLT- -SPSMEIKEVASIILHKDFKRANMDNDIALLLL 77 

Ml H I ^1 +hl H HII IM+^-^l + +HIII+ I 

SBJCT: 439 LPLQDVW RIYSGILNLSDITKDTPFSQIKE- - -IIIHQNYKVSEGNHDIALIKL 489 

QUERY: 78 ASPIKLDDLKVPICLPTQPGPAT-WRECWVAGWGQTNAADKNSVKTDLMKVPMVIMDWEE 136 

^ llllk^ -^1^111111^ ^1 I II + II 

SBJCT: 490 QAPLNYTEFQKPICLPSKGDTSTIYTNCWVTGWGFSK- -EKGEIQNILQKVNIPLVTNEE 547 

K 

QUERY: 137 CSKMFP- -KLTKNMLCAGYKNESYDACKGDSGGPLVCTPEPGEKWYQVGI ISWGKSCGDK 194 
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I I ^ l^l^lllll IIIMIIIMIII ^ I ^ Ilk I ^ 

SBJCT: 54 8 CQKRYQDYKj^WVCAGYKEGGKDACKGDSGGPLVC- -KHNGMWF^MTSWGEGCARR 605 



QUERY: 195 NTPGIYTSLVNYNLWIEKVTQ 215 

Ihll +111 + 11 
SBJCT: 606 EQPGVYTiCVAEYMDWILEKTQ 626 

K IS A RESIDUE THAT DIFFERS BETWEEN FCTR6A AND B. D193K. 

The FCTR6a amino acid has 73 of 183 amino acid residues (39%) identical to, and 1 10 
of 183 residues (59%) positive with, the 643 amino acid kallikrein [Sus scrofa] (GENBANK- 
ID:BAA37147.1) (SEQ ID NO:86) (Table 61). 

Table 61. BLASTP of FCTR6a and b against kallikrein [Sus scrofa] (SEQ ID NO:86) 

> GI I 41653 15 I DBJ | BAA37147 . 1 1 (AB022425) KALLIKREIN [SUS SCROFA] 
LENGTH =643 

SCORE = 128 BITS (322), EXPECT = 9E-29 

IDENTITIES = 73/183 (39%), POSITIVES = 110/183 (59%), GAPS = 12/183 (6%) 

QUERY: 38 VLGTNDLT- -SPSMEIKEVASIILHKDFKRANMDNDIALLLLASPIKLDDLKVPICLPTQ 95 

+ 1 +1 ++II M + I+ + +I +IIIII 1+1+ I + II1II++ 

SBJCT: 459 ILNISEITKETPFSQVKE I IIHQNYKILESGHDIALLKLETPLNYTDFQKPICLPSR 515 

QUERY: 96 PGP-ATWRECWVAGWGQTNAADKNSVKTDLMKVPMVIMDWEECSKMFP- -KLTKNMLCAG 152 

+ llllll I H ++ III + ++ III I + I++I l+lll 
SBJCT: 516 DDTNWYTNCWVTGWGFTE- -EKGEIQNILQKVNIPLVSNEECQKSYRDHKISKQMICAG 573 

K 

QUERY: 153 YKNESYDACKGDSGGPLVCTPEPGEKWYQVGI ISWGKSCGDKNTPGIYTSLVNYNLWIEK 212 

II lllll+lllllll + 1+ II 111+ t + ll+ll ++ I 11+ 

SBJCT: 574 YKEGGKDACKGESGGPLVC- -KYNGIWHLVGTTSWGEGCARREQPGVYTKVIEYMDWILE 631 

QUERY: 213 VTQ 215 
II 

SBJCT: 63 2 KTQ 634 

K IS A RESIDUE THAT DIFFERS BETWEEN FCTR6A AND B. D193K. 

The FCTR6a amino acid has 81 of 205 amino acid residues (39%) identical to, and 1 12 
of 205 residues (54%) positive with, the 625 amino acid Coagulation factor XI [Homo sapiens] 
(embCAA64368.1) (SEQ ID NO:87) (Table 6J). 

Table 6 J. BLASTP of FCTR6a and b against Coagulation factor XI [Homo sapiens] (SEQ 

ID NO:87) 

> GI| 1803 52 |GB|AAA51985.1 I {M20218) COAGULATION FACTOR XI [HOMO SAPIENS] 
LENGTH = 625 

SCORE = 127 BITS (320), EXPECT = lE-28 

IDENTITIES - 81/205 (39%), POSITIVES = 112/205 (54%), GAPS = 17/205 (8%) 

QUERY: 20 LPLAPLWFSATSPEELSWLGTNDLTSPSMEIKE VASIILHKDFKRANMDNDIA 73 

I I ++ 11+ I I I + + MM I ll+l +1 I III 
SBJCT: 427 LTAAHCFYGVESPKILRVYSGILNQS EIKEDTSFFGVQEIIIHDQYKMAESGYDIA 482 

QUERY: 74 LLLLASPIKLDDLKVPICLPTQPG-PATWRECWVAGWGQTNAADKNSVKTDLMKVPMVIM 132 

II I + + I + IIIII++ + +111 III II I I + 
SBJCT: 483 LLKLETTVNYTDSQRPICLPSKGDRNVIYTDCWVTGWGYRKLRDK- - IQNTLQKAKIPLV 540 



QUERY: 133 DWEE CS KM FP -- KLTKNMLCAG YKNESYDACKGDSGG PLVCTPEPGEKWYQVG I I SWGKS 190 
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Ill I 1-^1111+ lllllllllll I ^ ^^111 llh 

SBJCT: 541 TNEECQKRYB^MTHKMICAGYREGGKDACKGDSGGPLSC- -KHr^^BLVGITSWCEG 598 

QUERY: 191 CGDKNTPGIYTSLVNYNLWIEKVTQ 215 

I ^ IMI-^+l I II ^ II 
SBJCT: 599 CAQRERPGVYTNWEYVDWILEKTQ 623 

K IS A RESIDUE THAT DIFFERS BETWEEN FCTR6A AND B. D193K. 

The number of new cases of renal cell carcinoma in the United States in 1996 was 
projected to be 30,600 with an estimated 12,000 deaths. Tumors with a proposed histogenesis 
from the proximal tubule (clear-cell and chromophilic tumors) amount to 85% of renal cancers, 
whereas tumors with a proposed histogenesis from the connecting tubule/collecting duct 
(chromophobic-, oncocytic-, and duct Bellini-type tumors) amount to only 1 1%. 

Adenocarcinomas may be separated into clear cell and granular cell carcinomas, although 
the 2 cell types may occur together in some tumors. The distinction between well-differentiated 
renal carcinomas and renal adenomas can be difficult. The diagnosis is usually made arbitrarily 
on the basis of size of the mass, but size alone should not influence the treatment approach, since 
metastases can occur with lesions as small as 0.5 centimeters. 

While radical nephrectomy with regional lymphadenectomy, is the accepted, often 
curative therapy for stage I (localized disease) renal cell cancer, very little therapy is available 
for advance disease that represent about 70% of the patients. Radiotherapy as a postoperative 
adjuvant has not been effective, and when used preoperatively, may decrease local recurrence 
but does not appear to improve 5-yr survival. A chemotherapeutic agent capable of significantly 
altering the course of metastastic renal cell carcinoma has not been identified. (Renal Cell 
Cancer (PDQ®) Treatment - Health Professionals, Cancemet, NCI) 

There is therefore a need to identify genes that are differentially modulated in renal-cell 
carcinomas. In addition there is a need for methods to assay candidate therapeutic substances for 
modulating expression of these genes. These substances might be recombinant protein expressed 
by the identified genes or antibodies that bind to the identified proteins. There is yet additionally 
a need for an effective method of identifying target molecules or related components. These and 
related needs and defects are addressed in the present invention. 



Novel kallikrein-like/coagulation factor Xl-like Proteins and Nucleic Acids Encoding Same 

FCTR6 is surprisingly found to be differentially expressed in clear cell Renal cell 
carcinoma tissues vs the normal adjacent kidney tissues. The present invention discloses a novel 
protein encoded by a cDNA and/or by genomic DNA and proteins similar to it, namely, new 
proteins bearing sequence similarity to kallikrein-like, nucleic acids that encode these proteins or 
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fragments thereof, and ^^bodies that bind immunospecifically t^^rotein of the invention. It 
may have use as a therljBRic agent in the treatmentof renal can^^nd liver cirrhosis. 



The utility of kallikrein family members in protein therapy of Renal cancer 

5 The treatment of renal cell carcinoma with recombinant kallikrein could improve disease 

outcome through several potential mechanisms. The literature suggests that members of this 
protein family are inhibitory to the process of angiogenesis, a process of vital importance to 
tumor progression. Renal cell carcinoma is knovm to be a highly angiogenic cancer. Thus, 
treatment of renal cell carcinoma with kallikrein may effectively shutdovm the active recruitment 

10 of a blood supply to a tumor. Members of this protein family are known to play a role in 

vascular coagulation. Similar to anti-angiogenic therapy, a factor produced by cancer cells that 
is pro-coagulatory may also act to inhibit cancer grov^ by effectively "clogging" the tumor 
vascular supply. In addition, through its proteolytic activity, kallikrein may degrade ECM 
proteins or growth factors necessary for the progressive growth of cancer cells. Following is a 

1 5 relevant reference underlining the importance of Kallikrein in cancer therapy. 



The New Human Kallikrein Gene Family: Implications in Carcinogenesis. 

Diamandis EP; Yousef GM; Luo I; Magklara I; Obiezu CV 

Department of Pathology and Laboratory Medicine, Mount Sinai Hospital, Toronto, 
20 Ontario, Canada. 

Trends Endocrinol Metab 2000 Mar;l l(2):54-60. 

ABSTRACT: The traditional human kallikrein gene family consists of three genes, 
namely KLKl [encoding human kallikrein 1 (hKl) or pancreatic/renal kallikrein], KLK2 
(encoding hK2, previously knovm as human glandular kallikrein 1) and KLK3 [encoding hK3 or 

25 prostate-specific antigen (PSA)]. KLK2 and KLK3 have important applications in prostate 
cancer diagnostics and, more recently, in breast cancer diagnostics. During 

the past two to three years, new putative members of the human kallikrein gene family 
have been identified, including the PRSSLl gene [encoding normal epithelial cell-specific 1 
gene (NESl)], the gene encoding zyme/protease M/neurosin, the gene encoding prostase/KLK- 

30 LI, and the genes encoding neuropsin, stratum comeum chymotryptic enzyme and trypsin-Hke 
serine protease. Another five putative kallikrein genes, provisionally named KLK-L2, KLK-L3, 
KLK-L4, KLK-L5 and KLK-L6, have also been identified. Many of the newly identified 
kallikrein-like genes are regulated by steroid hormones, and a few kallikreins (NESl, protease 
M, PSA) are known to be downregulated in breast and possibly other cancers. NESl appears to 
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be a novel breast cancej^mor suppressor protein and PSA a pol^^nhibitor of angiogenesis. 
This brief review sunuHEes recent developments and possible a^lications of the newly 
defined and expanded human kallikrein gene locus. 



5 The utility of kallikrein-like/coagulation factor Xl-iike family members in protein 

therapy of liver cirrosis 

Results related to inflammation shown below in Example A, Table CC3, panel 4, indicate 
over-expression of 27455 1 83.0.19 in the liver cirrhosis sample, as compared to panel 1 data 
(Table CCl), where there is little or no expression in normal aduh liver. Panel 4 was generated 
10 from various human cell lines that were untreated or resting as well as the same cells that were 
treated with a wide variety of inrmiune modulatory molecules. There are several disease tissues 
represented as well as organ controls. 

Potential Role(s) of FCTR6 in Inflammation: 

15 Liver cirrhosis occurs in patients with hepatitis C and also in alcoholics. This protein is 

41% related to coagulation factor XI and its potential role in liver cirrhosis may be related to 

cleavage of kininogen. A reference for this follows: 

Thromb Haemost 2000 May;83(5):709-14 High molecular weight kininogen is cleaved 

by FXIa at three sites: Arg409-Arg410, Lys502-Thr503 and Lys325-Lys326. Mauron T, Lammle 
20 B, Wuillemin WA Central Hematology Laboratory, University of Bern, Inselspital, Switzerland. 

Abstract: 

We investigated the cleavage of high molecular weight kininogen (HK) by activated 
coagulation factor XI (FXIa) in vitro. Incubation of HK with FXIa resulted in the generation of 
cleavage products which were subjected to SDS-Page and analyzed by silverstaining, ligand- 

25 blotting and immunoblotting, respectively. Upon incubation with FXIa, bands were generated at 
1 1 1, 100, 88 kDa on nonreduced and at 76, 62 and 51 kDa on reduced gels. Amino acid sequence 
analysis of the reaction mixtures revealed three cleavage sites at Arg409-Arg410, at Lys502- 
Thr503 and at Lys325-Lys326. Analysis of HK-samples incubated with FXIa for 3 min, 10 min 
and 120 min indicated HK to be cleaved first at Arg409-Arg410, followed by cleavage at 

30 Lys502-Thr503 and then at Lys325-Lys326. In conclusion, HK is cleaved by FXIa at three sites. 
Cleavage of HK by FXIa results in the loss of the surface binding site of HK, which may 
constitute a mechanism of inactivation of HK and of control of contact system activation. 

Impact of Therapeutic Targeting of FCTR6 in Inflammation: 
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Therapeutic ta^^ig of FCTR6 with a monoclonal antib^L is anticipated to limit or 
block the extent of brelBown of kininogen and thereby reduce t^^egradation of liver that 
occurs in liver cirrhosis. A pertinent reference is: 

Thromb Haemost 1999 Nov;82(5): 1428-32 Parallel reduction of plasma levels of high and low 
molecular weight kininogen in patients with cirrhosis. 

Cugno M, Scott CF, Salerno F, Lorenzano E, Muller-Esterl W, Agostoni A, Colman RW 
Department of Intemal Medicine, IRCCS Maggiore Hospital, University of Milan, Italy, 
massimo . cugno@unimi . it 
Abstract: 

Little is known about the regulation of high-molecular- weight-kininogen (HK) and low- 
molecular- weight-kininogen (LK) or the relationship of each to the degree of liver function 
impairment in patients with cirrhosis. In this study, we evaluated HK and LK quantitatively by a 
recently described particle concentration fluorescence immunoassay (PCFIA) and qualitatively 
by SDS PAGE and immunoblotting analyses in plasma from 33 patients with cirrhosis 
presenting various degrees of impairment of liver function. Thirty-three healthy subjects served 
as normal controls. Patients with cirrhosis had significantly lower plasma levels of HK (median 
49 microg/ml [range 22-99 microg/ml]) and LK (58 microg/ml [15-100 microg/ml]) than normal 
subjects (HK 83 microg/ml [65-1 15 microg/ml]; LK 80 microg/ml [45-120 microg/ml]) 
(p<0.0001). The plasma concentrations of HK and LK were directly related to plasma levels of 
cholinesterase (P<0.0001) and albumin (P<0.0001 and P<0.001) and inversely to the Child-Pugh 
score (P<0.0001) and to prothrombin time ratio (P<0.0001) (reflecting the clinical and laboratory 
abnormalities in liver disease). Similar to normal individuals, in patients wdth cirrhosis, plasma 
HK and LK levels paralleled one another, suggesting that a coordinate regulation of those 
proteins persists in liver disease. SDS PAGE and immunoblotting analyses of kininogens in 
cirrhotic plasma showed a pattern similar to that observed in normal controls for LK (a single 
band at 66 kDa) with some lower molecular weight forms noted in cirrhotic plasma. A slight 
increase of cleavage of HK (a major band at 130 kDa and a faint but increased band at 107 kDa) 
was evident. The increased cleavage of HK was confirmed by the lower cleaved kininogen index 
(CKI), as compared to normal controls. These data suggest a defect in hepatic synthesis as well 
as increased destructive cleavage of both kininogens in plasma from patients with cirrhosis. The 
decrease of important regulatory proteins like kininogens may contribute to the imbalance in 
coagulation and fibrinolytic systems, which frequently occurs in cirrhotic patients. 

In sunmiary, the differential expression of FCTR6 (Kallikrein family) in renal cell 
carcinoma is an important finding that could have inmiense potential in renal carcinogenesis. In 
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ates its anticipated use as an 



immunotherapeutic targBr 
FCTR7 

The novel nucleic acid of 1498 nucleotides FCTR7 (also designated. 32592466.0.64) 
encoding a novel trypsin inhibitor-like protein is shovra in Table 7A. An ORF begins with an 
ATG initiation codon at nucleotides 470-472 and ends v^th a TAA codon at nucleotides 1369- 
1371. Putative untranslated regions, if any, are found upstream from the initiation codon and 
downstream from the termination codon. 



AGGCGCCTGGTTCTGCGCGTACTGGCTGTACGGAGCAGGAGCAAGAGGTCGCCGCCAGCCTCCGCCGCCGAGCCTCGTTCGTGTCC 

CCGCCCCTCGCTCCTGCAGCTACTGCTCAGAAACGCTGGGGCGCCCACCCTGGCAGACTAACGAAGCAGCTCCCTTCCCACCCCAA 

CTGCAGGTCTAATTTTGGACGCTTTGCCTGCCATTTCTTCCAGGTTGAGGGAGCCGCAGAGGCGGAGGCTCGCGTATTCCTGCAGT 

CAGCACCCACGTCGCCCCCGGACGCTCGGTGCTCAGGCCCTTCGCGAGCGGGGCTCTCCGTCTGCGGTCCCTTGTGAAGGCTCTGG 

GCGGCTGCAGAGGCCGGCCGTCCGGTTTGGCTCACCTCTCCCAGGAAACTTCACACTGGAGAGCCAAAAGGAGTGGAAGAGCCTGT 

CTTGGAGATTTTCCTGGGGAAATCCTGAGGTCATTCATT ATGAAGTGTACCGCGCGGGAGTGGCTCAGAGTAACCACAGTGCTGT^ 

CATGGCTAGAGCAATTCCAGCCATGGTGGTTCCCAATGCCACTTTATTGGAGAAACTTTTGGAAAAATACATGGATGAGG^ 

AGTGGTGGATAGCCAAACAACGAGGGAAAAGGGCCATCACAGACAATGACATGCAGAGTATTTTGGACCTTCATAATAAATTAC^^ 

AGTCAGGTGTATCCAACAGCCTCTAATATGGAGTATATGACATGGGATGTAGAGCTGGAAAGATCTGCAGAATCCAGGGCTGAAAT 

TGCTTGTGGGAACATGGACCTGCAAGCTTGCTTCCATCAATTGGACTVGAATTTGGGAGCACACTGGGGAAGATATA^ 

GTTTCT^TGTACAATCGTGGTATGATGAAGTGAAAGACTTTAGCTACCCATATGAACATGAATGCAACCCATATTGT^ 

GTTCTGGCCCTGTATGTACACATTATACACAGGTCGTGTGGGCAACTAGTAACAGAATCGGTTGTGCCATTAATTTGTGTCATAAC 

ATGAACATCTGGGGGCAGATATGGCCCAAAGCTGTCTACCTGGTGTGCAATTACTCCCCAAAGGGAAACTGGTGGGGC^ 

TTACAAACATGGGCGGCCCTGTTCTGCTTGCCCACCTAGTTTTGGAGGGGGCTGTAGAGAAAATCTGTGCTACAAAGAAGGGTCAG 

ACAGGTATTATCCCCCTCGAGAAGAGGAAACAAATGAAATAGAACGGCAGCAGTCACAAGTCCT^TGACAC 

TCAGATGATAGTAGCAGAAATGAAGTCATTAGCTTTGGGAAAAGTAATGAAAATATAATGGTTTTAGAAATCCTGTGTTAAM^^ 
GCTATATTTTCTTAGCAGTTATTTCTACAGTTAATTACATAGTCATGATTGTTCTACGTTTCATATATTATATGGTGCTTTGTATA 
TGCCCCTAATAAAATGAATCTAAACATTGAAAAAAA 

The FCTR7 protein encoded by SEQ ID NO:24 has 300 amino acid residues and is 
presented using the one-letter code in Table 7B. The FCTR7 gene was found to be expressed in: 
brain; germ cell tumors. FCTR7 gene maps to Unigene cluster Hs. 182364 which is expressed in 
the following tissues: brain, breast, ear, germ cell, heart, liver, lung, whole embryo, ovary, 
pancreas, pooled, prostate, stomach, testis, uterus, vascular. Therefore the FCTR7 protein 
described in this invention is also expressed in the above tissues. 

The SignalP, Psort and/or Hydropathy profile for FCTR7 predict that this sequence has a 
signal peptide and is likely to be localized outside of the cell with a certainty of 0.4228. The 
SignalP shows a cleavage site between amino acids 20 and 21, i.e., at the dash in the sequence 
amino acid ARA-IP. The predicted molecular weight of FCTR7 is 34739.9 Daltons. Hydropathy 
profile shows an amino terminal hydrophobic region. This region could ftmction as a signal 
peptide and target the invention to be secreted or plasma membrane localized. 



Table 7A. FCTR7 Nucleotide Sequence (SEQ ID NO:24) 
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Table 7^:ncoded FCTR7 protein sequence ID NO:25). 
mkctarewlrvttvlf^iara^JRa/pnatllekllekymdedgewwiakqrgk^ 

twdvelersaesraesclwehgpasllpsigqniigahwgryrpptfhvqswydevkdfsypyehecnpycpfrcsgpv^ 

watsnrigcainlchnnnsriwgqiwpkavylvoffspkgnwwghapykhgrpcsacppsfgc^ 

i erqqsqvhdthvrtrsddssrnevi s fgksnen i mvle i lc 

This gene maps to Unigene cluster Hs. 182364 which has been assigned the following 
mapping information shown in table 7C. Therefore the chromosomal assignment for this gene is 
the same as that for Unigene cluster 182364. 

Table 7C. Mapping Information. 

Chromosome: 8 

Gene Map 98: Marker SHGC-32056 , Interval D8S279-D8S526 

Gene Map 98: Marker SGC32056 , Interval D8S526-D8S275 

Gene Map 98: Marker sts-G20223 , Interval D8S526-D8S275 

Gene Map 98: Marker stSG30385 , Interval D8S526-D8S275 

Whitehead map: EST67946, Chr.8 

dbSTS entries: G25853, G29349, G20223 

The predicted amino acid sequence was searched in the publicly available GenBank 
database 

FCTR7 protein showed Score = 743 (261.5 bits). Expect = L4e-73, P = 1.4e-73, 54 % 
identities (129 over 237 amino acids) and 43% homologies (167 over 237 amino acids) with 
human 25 kD trypsin inhibitor protem (258 aa; ACC:043692) (Table 7D). 

Table 7D. BLAST X search results are shown below: 

ptnr :SPTREMBL-ACC:043692 25 KDA TRYPSIN INHIBITOR - HO... +2 743 8.4e-73 1 (SEQ 
ID NO: 88) 

ptnr :SPTREMBL-ACC: 044228 HRTT-1 - HALOCYNTHIA RORETZI ... +2 325 2.9e-28 1 (SEQ 
ID NO:89) 

ptnr:SWISSPROT-ACC:P48060 GLIOMA PATHOGENESIS -RELATED ... +2 314 5.3e-27 1 (SEQ 
ID NO: 90) 

ptnr :PIR-ID: JC4131 glioma pathogenesis-related protein... +2 309 2.0e-26 1 (SEQ 
ID NO: 91) 

ptnr :SWISSNEW-ACC:O19010 CYSTEINE-RICH SECRETORY PROTE . . . +2 258 9.4e-21 1 (SEQ 
ID NO: 92) 

The nucleotide sequence of FCTR7 has 954 of 957 residues (99 %) identical to the 1-957 
base segment, and 174 of 175 residues (99%) identical to bases 1317-1953 of the 2664 
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nucleotide Homo sapieu^anidXiv^ secretory protein precursor, nj|BL\ (GenBank-ACC: 
AF142573) (SEQ ID >W3) (Table 7E). ^ 

Table 7E. BLASTN of FCTR7 against Putative secretory protein precursor (SEQ ID 

NO:93) 

> qi 1 12002310 |qb| AF142573 . 1 1 AF142573 Homo sapiens putative secretory protein 
precursor, mRNA, complete cds 
Length = 2664 

Score = 1865 bits (941), Expect = 0.0 
Identities = 954/957 (99%), Gaps = 1/957 (0%) 
Strand = Plus / Plus 

gtccggtttggctcacctctcccaggaaacttcacactggagagccaaaaggagtggaag 423 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

gtccggtttggctcacctctcccaggaaacttcacactggagagccaaaaggagtggaag 60 
agcctgtcttggagattttcctggggaaatcctgaggtcattcattatgaagtgtaccgc 483 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

agcctgtcttggagattttcctggggaaatcctgaggtcattcattatgaagtgtaccgc 120 
gcgggagtggctcagagtaaccacagtgctgttcatggctagagcaattccagccatggt 543 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

gcgggagtggctcagagtaaccacagtgctgttcatggctagagcaattccagccatggt 180 
ggttcccaatgccactttattggagaaacttttggaaaaatacatggatgaggatggtga 603 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

ggttcccaatgccactttattggagaaacttttggaaaaatacatggatgaggatggtga 240 
gtgg tggatagccaaacaacgagggaaaagggccatcacagacaa tgacatgcagagtat 663 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

gtgg tggatagccaaacaacgagggaaaagggccatcacagacaa tgacatgcagagtat 3 00 
tttggaccttcataataaattacgaagtcaggtgtatccaacagcctctaatatggagta 723 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

tttggaccttcataataaattacgaagtcaggtgtatccaacagcctctaatatggagta 360 
tatgacatgggatgtagagctggaaagatctgcagaatccagggctgaaa-ttgcttgtg 782 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii iiiiiiiii iiiiiiiii 

tatgacatgggatgtagagctggaaagatctgcagaatcctgggctgaaagttgcttgtg 420 
ggaacatggacctgcaagcttgcttccatcaattggacagaatttgggagcacactgggg 842 

liiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

ggaacatggacctgcaagcttgcttccatcaattggacagaatttgggagcacactgggg 480 
aagatataggcccccgacgtttcatgtacaatcgtggtatgatgaagtgaaagactttag 902 

liiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

aagatataggcccccgacgtttcatgtacaatcgtggtatgatgaagtgaaagactttag 540 
ctacccatatgaacatgaatgcaacccatattgtccattcaggtgttctggccctgtatg 962 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

ctacccatatgaacatgaatgcaacccatattgtccattcaggtgttctggccctgtatg 600 
tacacattatacacaggtcgtgtgggcaactagtaacagaatcggttgtgccattaattt 1022 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

tacacattatacacaggtcgtgtgggcaactagtaacagaatcggttgtgccattaattt 660 
gtgtcataacatgaacatctgggggcagatatggcccaaagctgtctacctggtgtgcaa 1082 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

gtgtcataacatgaacatctgggggcagatatggcccaaagctgtctacctggtgtgcaa 720 
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S^uci. y . 


364 


Sbj Ct : 






•» ^ 


qVj-i pit- . 




Query i 


•x O rt 


Sbj ct : 


121 


y . 


J 1 


Sbj ct : 


181 


Ou^rv • 

S^iu^j. y . 


604 


Sbj Ct : 


241 


Ouerv • 


664 


Sbjct : 


301 


Query : 


724 


Sbjct : 


361 


Query : 


783 


Sbjct : 


421 


Query : 


843 


Sbjct : 


481 


Query : 


903 


Sbjct: 


541 


Query : 


963 


Sbjct: 


601 


Query: 


102 


Sbjct: 


661 



25 



30 



45 



Query: 1083 ttactc^^^aagggaaactggtggggccatgccccttc^^HLcatgggcggccctgttc 1142 

iiiiiiWiiiiiiiiiiiiiiiiiiiiiiiiiiiiiirTTiiiiiiiiiiiiiiiiii 

Sbjct : 721 ttactccccaaagggaaactggtggggccatgccccttacaaacatgggcggccctgttc 780 

5 

Query: 1143 tgcttgcccacctagttttggagggggctgtagagaaaatctgtgctacaaagaagggtc 12 02 

IIIIMIIIIIIIIIIilllllllllllllllllllllllllllllllllllllllllll 

Sbjct : 781 tgcttgcccacctagttttggagggggctgtagagaaaatctgtgctacaaagaagggtc 840 
10 Query: 1203 agacaggtattatccccctcgagaagaggaaacaaatgaaatagaacggcagcagtcaca 1262 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii iiiiiiiiiii 

Sbjct : 841 agacaggtattatccccctcgagaagaggaaacaaatgaaatagaacgacagcagtcaca 900 
Query: 1263 agtccatgacacccatgtccggacaagatcagatgatagtagcagaaatgaagtcat 1319 

15 llllll MM I II I IIIIIIIIIII II II I III I UN III MM I IIIIIIIIIII 

Sbjct : 901 agtccatgacacccatgtccggacaagatcagatgatagtagcagaaatgaagtcat 957 

Score = 339 bits (171), Expect = 3e-90 
Identities = 174/175 (99%) 
20 Strand = Plus / Plus 



Query : 


1317 


Sbjct : 


1779 


Query: 


1377 


Sbjct : 


1839 


Query : 


1437 


Sbjct : 


1899 



llllllllillllllllllllllllllllllllllllllllllllMIIIIIIIIIIIII 



gctatattttcttagcagttatttctacagttaattacatagtcatgattgttctacgtt 1436 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

gctatattttcttagcagttatttctacagttaattacatagtcatgattgttctacgtt 1898 



IIMIIIIIIIIIIIIIIIIIIIIIII llllllllllllllllllllllll 



The FCTR7 amino acid has 284 of 285 amino acid residues (99%) identical to, and 284 
35 of 285 amino acid residues (99%) similar to, the 500 amino acid Putative secretory protein 
precursor [Homo sapiens] (GenBank-Acc No.: AF 142573) (SEQ ID NO:94) (Table 7F). 

Table 7F, BLASTP alignments of FCTR7 against Putative secretory protein precursor, 

(SEQ ID NO:94) 

> cri 1 12002311 1 qb| AAG43287 . 1 1 AF142573_1 {AF142573) putative secretory protein 
40 precursor [Homo sapiens] 
Length = 500 



Score = 581 bits (1499), Expect = e-165 
Identities = 284/285 (99%), Positives = 284/285 (99%) 

MKCTAREWLRVTTVLFMAIU^IPAIWVPNATLLEKIjLEKYMDEDGEWWIAKQRGK^ 6 0 

MIIIIIMIIIIMIilllllllllllllllMllllllllillMIIIIIIIIIIIII 

MKCTAREWLRVTTVLFMARAIPAMWPNATLLEKXLEKYMDEDGEWWIAKQRGKRAITD 6 0 
50 Query: 61 DMQSILDLHNKLRSQVYPTASNMEYMTWDVELERSAESRAESCLWEHGPASLLPSIGQNL 120 

IIIIIIIIIIIIMIIIIIIilllMIIMIMIIIM illllMIIMIIIIIMMI 

DMQSILDLHNKLRSQWPTASl^EYMTWDVELERSAESWAESCLWEHGPASLLPSIGQNL 120 



Query : 


1 


Sbjct: 


1 


Query: 


61 


Sbjct: 


61 


Query : 


121 


Sbjct: 


121 


Query: 


181 



GAHWGRYRPPTFHVQSWYDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQVVWATSNRIG 180 

^3 ~ M I M 1 1 1 1 1 1 1 M M I M 1 1 1 M 1 1 1 1 M M 1 1 M 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 

GAHWGRYRPPTFHVQSWYDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQVVWATSNRIG 180 
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30 



45 



I M 1 1 1 1 Jill 1 1 1 1 1 M 1 1 1 1 1 1 i 1 1 M I M M I i M I lii^l 1 1 1 M 1 1 1 M 1 1 1 1 1 

Sbjct: 181 CAI^^^CH^^WGQIWPKAVyLVCNYSPKGNWWGHAPYKH^■SACPPSFGGGCRENLC 240 



Query: 241 YKEGSDRYYPPREEETNEIERQQSQVHDTHVRTRSDDSSRNEVIS 285 

3 illllMIIMIIIIIIillMIIIIIIIIMIMIIMIIIIII 

Sbjct: 241 YKEGSDRYYPPREEETNEIERQQSQVHDTHVRTRSDDSSRNEVIS 285 

The FCTR7 amino acid has 137 of 176 amino acid residues (78%) identical to, and 151 
of 176 amino acid residues (86%) similar to, the 188 amino acid Late gestation lung protein 1 
10 [Rattus norvegicus] (GenBank-Acc No.: AF109674) (SEQ ID NO:95) (Table 7G). 

Table 7G. BLASTP alignments of FCTR7 against Late gestation lung protein 1, (SEQ ID 

NO:95) 

> Qi|4324682 |qb|AAD16986 .1 1 (AF109674) late gestation lung protein 1 [Rattus 
norvegicus] 
15 Length = 188 

Score = 277 bits (709) , Expect = le-73 

Identities = 137/176 (78%) , Positives = 151/176 (86%) 



Query: 


68 


Sbjct : 


2 


Query: 


128 


Sbjct : 


62 


Query: 


188 


Sbjct: 


122 



mill MM llllllllll llllll I lllllllllll llllll Mill 

LHNKLRGQVYPPASNMEYMTWDEELERSAAAWAQRCLWEHGPASLLVS IGQNLAVHWGRY 6 1 



25 I I lllllllllllk+lll llllhll MM +IMIIhMIIHHMh+ I 

RSPGFHVQSWYDEVKDYTYPYPHECNPWCPERCSGAMCTHYTQMVWATTNKIGCAVHTCR 121 



+ M IIIIIIIIMMM I llllllllll II kllMI IIIM 



The FCTR7 amino acid has 130 of 237 amino acid residues (55%) identical to, and 165 
of 237 amino acid residues (70%) similar to, the 258 amino acid R3H domain-containing 
preproprotein; 25 kDa trypsin inhibitor [Homo sapiens] (GenBank-Acc No.: D45027) (SEQ ID 
35 NO:96) (Table 7H). 

Table 7H. BLASTP alignments of FCTR7 against R3H domain-containing preproprotein, 

25 kDa trypsin inhibitor (SEQ ID NO:96) 

> cri I 7705676 I ref | NP_056 970 . 1 1 R3H domain- containing preproprotein; 25 kDa 
40 trypsin inhibitor/ R3H 

domain (binds single- stranded nucleic acids) containing 
[Homo sapiens] 

Qi|2943716|dbi |BAA25066.i| (D45027) 25 kDa trypsin inhibitor [Homo sapiens] 
Length = 258 



Score = 265 bits (678) , Expect = 4e-70 

Identities = 130/237 (55%), Positives = 165/237 (70%), Gaps = 3/237 (1%) 



Query: 12 TTVLFMARAIPAMWPNATLLEKLLEKYMDEDGEVmiAKQRGKRAITDNDMQSILDL^^ 71 

50 +11+ + + I I ^1 1^ ^1 I I I II 1+ III ^111 11^ 

Sbjct: 20 STWLLNSTDSSPPTNNFTDIEAALKAQLDSAD IPKARRKRYXSQNDMIAILDYHNQ 76 
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Query: 72 LRSQVYPTASpiEYMTWDVELERSAESRAESCLWEHGPASM^IGQN^AHWGRYRPPT 131 

H HhAiIM M I Ml I hl^Mk BHIil MM 

Sbjct: 77 VRGKVFPflBfMEYIWWDENIiAKSAEAWAATCIWDHGPSYLER^ 136 

5 Query: 132 FHVQSWYDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQVWATSNRIGCAINLCHNl^I 191 

1+ IMIMh++M HII M M IhMMMHMIMIMIM^ I Mk 

Sbjct: 137 QLVKPWYDEVKDYAFPYPQDCNPRCPMRCFGPMCTHYTQMVWATSNRIGCAIHTCQNMNV 196 
Query: 192 WGQIWPKAVYLVCNYSPKGNWWGHAPYKHGRPCSACPPSFGGGCRENLCYKEGSDRY 248 

10 M +1 +MMMM + MMI I MM I MhMM + M I +MK + I 

Sbjct: 197 WGSVWRRAVYLVCNYAPKGNWIGEAPYKVGVPCSSCPPSYGGSCTDNLCFPGVTSNY 253 

The FCTR7 amino acid has 109 of 233 amino acid residues (47%) identical to, and 146 
of 233 amino acid residues (63%) similar to, the 253 amino acid Novel protein similar to a 
1 5 trypsin inhibitor [Homo sapiens] 25 kDa trypsin inhibitor (EMBLAcc No.: ALl 1 7382) (SEQ ID 
NO:97) (Table 71). 

Table 71. BLAST? alignments of FCTR7 against Novel protein similar to a trypsin 

inhibitor, (SEQ ID NO:97) 

20 

> ai I 9885193 | emb | CAC04190 . 1 1 (AL117382) dJ881L22.3 (novel protein similar to a 
trypsin 

inhibitor) [Homo sapiens] 
Length = 253 

25 

Score = 225 bits (575), Expect = 4e-58 

Identities = 109/233 (47%), Positives = 146/233 (63%), Gaps = 8/233 (3%) 

Query: 10 RVTTVLFMARAIPAMWPNATLLEKLLEKYMDEDGEWWIAKQRGKRAITO 69 

30 -H I I I II I +1 + I + + III h II +HI I 

Sbjct: 19 QAVNALIMPNATPAPAQPESTAMRLL SGLEVPRYRRKRHISVRDMNALLDYH 70 

Query: 70 NKLRSQVYPTASNMEYMTWDVELERSAE SRAE S CLWEHG PAS LLPS IGQNLGAHWGRYRP 12 9 

I +1+ III hlllll II I I II I hi llh t+ +IIII 11 + 11 
35 Sbjct: 71 NHIRASVYPPAANMEYMWDKRLARAAEAWATQCIWAHGPSQLl^WGQNLSIHSGQTO 130 

Query: 130 PTFHVQSWTOEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQVVWATSNRIGCAINLCHNN 189 

++II H + H Hll + lhll II hllll + llkllKMIh I 

Sbjct: 131 VVDLMKSWSEEKWHYLFPAPRDCNPHCPWRCDGPTCSHYTQMVWASSNRLGCAIHTCSSI 190 



40 



45 



Query: 190 NIWGQIWPKAVYLVCNYSPKGNWWGHAPYKHGRPCSACPPSFGGGCRENLCYK 242 

++II I +1 milk MM I +11! hllhlllh I I KM 

Sbjct: 191 SVWGNTWHRAAYLVCNYAIKGNWIGESPYKMGKPCSSCPPSYQGSCNSNMCFK 243 



The FCTR7 amino acid has 129 of 237 amino acid residues (54%) identical to, and 167 
of 237 amino acid residues (70%) similar to, the 258 amino acid 25 kDa Trypsin Inhibitor from 
Homo sapiens (EMBLAcc No.: 043692) (SEQ ID NO:88) (Table 7J). 

Table 7J. BLASTP alignments of FCTR7 against 25 kDa Trypsin Inhibitor, (SEQ ID 
50 NO:88) 
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ptnr : SPTREMBL- A£C ; 043 6 92 25 KDA TRYPSIN INHIBITOR 





Lo sapiens (Human) , 258 



aa. 



Score = 743 (261.5 bits), Expect = 1.6e-73, P ^ 1.6e-73 
Identities = 129/237 (54%), Positives = 167/237 (70%) 

The FCTR7 amino acid has 79 of 193 amino acid residues (40%) identical to, and 1 10 of 
193 amino acid residues (56%) similar to, the 266 amino acid Glioma Pathogenesis-Related 
Protein (RTVP-1 Protein) - Homo sapiens (SWISSPROT Acc No.: P48060) (SEQ ID NO:90) 
(Table 7K). 

Table 7K. BLASTP alignments of FCTR7 against Glioma Pathogenesis-Related Protein, 



ptnr:SWISSPROT-ACC:P48060 GLIOMA PATHOGENESIS-RELATED PROTEIN (RTVP-1 PROTEIN) 
- Homo sapiens (Human), 266 aa 

Score = 314 (110.5 bits). Expect = 4.7e-28, P = 4.7e-28 
Identities = 79/193 (40%) , Positives = 110/193 (56%) 



The FCTR7 amino acid has 66 of 186 amino acid residues (35%) identical to, and 91 of 
186 amino acid residues (48%) similar to, the 186 amino acid Neutrophil granules matrix 
glycoprotein SGP28 precursor from Homo sapiens (SWISSPROT Acc No.: S68691) (SEQ ID 
NO:98) (Table 7L). 

Table 7L. BLASTP alignments of FCTR7 against Neutrophil granules matrix glycoprotein, 



Score = 254 (89.4 bits). Expect = l.le-21, P = l.le-21 
Identities = 66/186 (35%), Positives = 91/186 (48%) 

A novel developmentally regulated gene with homology to a tumor derived trypsin 
inhibitor is expressed in lung mesenchyme, as described in Am. J. Physiol. 0:0-0(1999). cDNA 
cloning of a novel trypsin inhibitor v^th similarity to pathogenesis-related proteins, and its 
frequent expression in human brain cancer cells is disclosed in Biochim. Biophys. Acta 
1395:202-208(1998). RTVP-1, a novel human gene with sequence similarity to genes of diverse 
species, is expressed in timior cell lines of glial but not neuronal origin, as published in Gene 
180:125-130(1996). The human glioma pathogenesis-related protein is structurally related to 
plan pathogenesis-related proteins and its gene is expressed specifically in brain tumors (Gene 
159:131-135(1995)). Structure comparison of human glioma pathogenesis-related protein GliPR 
and the plant pathogenesis-related protein PI 4a indicates a ftmctional link between the human 



(SEQ ID NO:90) 



(SEQ ID NO:98) 



ptnr:PIR-ID:S68691 neutrophil granules matrix glycoprotein SGP28 precursor - 



human 



116 
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immune system and a defense system (Proc. Natl. Acad. Sj^tki.S.A. 95:2262-2266(1998)). 
GliPR is highly expresSRn the human brain tumor, glioblastomSnultiform/astrocytome, but 
neither in normal fetal or adult brain tissue, nor in other nervous system tumors. GliPR belongs 
to a family that groups manmialian SCP/TPXl; insects AG3/AG5; FUNGI SC7/SC14 and plants 
PR-1 . SGP28, a novel matrix glycoprotein in specific granules of human neutrophils v^th 
similarity to a human testis-specific gene product and to a rodent sperm-coating glycoprotein 
(FEBS Lett. 380, 246-250, 1996). The primary structure and properties of helothermine, a 
peptide toxin that blocks ryanodine receptors is described in Biophys. J. 68:2280-2288(1995). As 
GliPR, Helothermine belongs to a family that groups mammalian SCP/TPXl; insects AG3/AG5; 
FUNGI SC7/SC14 and plants PR-1 . 

Based upon homology, FCTR7 protein and each homologous protein or peptide may 
share at least some activity. 



Therapeutic uses: 

FCTR7 protein has homology to trypsin inhibitors, Q91055 helothermine, tumor derived 
tyrpsin inhibitors, glioma pathogenesis-related protein, Q9Z0U6 LATE GESTATION LUNG 
PROTEIN 1 , and to the Prosite family which groups mammalian SCP/TPXl ;INSECTS 
AG3/AG5; FUNGI SC7/SC14 AND PLANTS PR-1 proteins. Therefore the FCTR7 protein 
disclosed in this invention could function like the proteins which it has homology to. These 
functions include tissue development in vitro and in vivo, and cancer pathogenesis. 

Based the tissue expression pattem, the gene is implicated in diseases of tissues in which 
it is expressed. These diseases include but are not limited to: 

• Glioma, 

• cancer, 

• lung diseases, 

• gestation, 

• male and female reproductive diseases, 

• deafness, 

• neurological disorders, 

• gastric disorders, and 

• pancreatic diseases like diabetes. 



These materials are further useful in the generation of antibodies that bind 
immunospecifically to the novel FCTR7 substances for use in therapeutic or diagnostic methods. 
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These antibodies may ^^nerated according to methods knowj^fcdie art, using prediction from 
hydrophobicity charts, IRiescribed in the "Anti-FCTRX Antiboaies" section below. In one 
embodiment, a contemplated FCTR7 epitope is from aa 40 to 120. In another embodiment, a 
FCTR7 epitope is from aa 130 to 170. In additional embodiments, FCTR7 epitopes are from aa 
210 to 230, and from aa 240 to 280. 



TABLE 8A: Summary Of Nucleic Acids And Proteins Of The Invention 



Name 


Tables 


Clone; Description ofHomoIog 


Nucleic Acid 
SEQ ID NO 


Amino Acid 
SEQ ID NO 


FCTRl 


lA, IB, 


58092213.0.36 follistatin-like protein 


1 


2 


FCTR2 


2A, 2B 


AC012614_1.0.123; KIAA1061-like protein 


3 


4 


FCTR3 


3A, 3B 


10129612.0.118; neurestin-like protein 


5 


6 




3C, 3D 


10129612.0.405; neurestin-like protein 


7 


8 




3E 


10129612.0.154; neurestin-like protein 


9 






3F 


10129612.0.67; neurestin-like protein 


10 






3G 


10129612.0.258; neurestin-like protein 


11 






3H, 31 


10129612.0.352; neurestin-like protein 


12 


13 


FCTR4 


4A, 4B 


29692275.0.1; NF-Kappa-B P65delta3-like 
protein 


14 


15 


FCTR5 


5A, 5B 


32125243.0.21; human complement CIR 
component precursor -like protein 


16 


17 




5C, 5D 




18 


19 


FCTR6 


6A, 6B 


27455183.0.19; novel human blood 
coagulation factor XI -like protein 


20 


21 




6C, 6D 


27455183.0.145; novel human blood 
coagulation factor XI -like protein 


22 


23 


FCTR7 


7A, 7B 


32592466.0.64; trypsm inhibitor -like protein 


24 


25 


FCTRl 


Example 2 


Ag809 Forward 


26 




FCTRl 


Example 2 


Ag809 Probe 


27 




FCTRl 


Example 2 


Ag809 Reverse 


28 




FCTR4 


Example 2 


Ag2773 Forward 


29 




FCTR4 


Example 2 


Ag2773 Probe 


30 




FCTR4 


Example 2 


Ag2773 Reverse 


31 




FCTR5 


Example 2 


Ag427 Forward 


32 




FCTR5 


Example 2 


Ag427 Probe 


33 




FCTR5 


Example 2 


Ag427 Reverse 


34 




FCTR6 


Example 2 


Agl541 Forward 


35 




FCTR6 


Example 2 


Agl541 Probe 


36 




FCTR6 


Example 2 


Agl541 Reverse 


37 





TABLE 8B: Summary of Query Sequences Disclosed 



Table 


Database 


Acc. No, 


Sequence Name 


Species 


SEQ ID NO. 


IC, IK 


remtrEmbl 


BAA21725 


IGFBP-like protein 


mouse 


38 


ID 


sptrEmbI 


Q61581 


Follistatin-like protein-2 


Mouse 


39 



118 
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IE 


SptrEmbl ]^822 


Mac25 protein 


Human 


40 


IF, IK 


SptrEmbI " 


^812 


Mac25 protein 


Mouse 


41 


IG, IK 


SptrEmbl 


Q 16270 


Prostacyclin-stimulating factor 


Human 


42 


IH, IK 


PIR 


B40098 


Colorectal cancer suppressor 


Rat 


43 


11 


TrEmblne 
w 


AAD9360 


PTP sigma (brain) precursor 


Human 


44 


IJ 


SptrEmbl 


Q13332 


PTP sigma precursor 


Human 


45 


2C 


GenBank 


AB028984 


KIAA1061 cDNA 


Human 


46 


2D 


i llZflllUillC 
W 


DrvrVO JO / / 


KIAA1263 


Human 


47 


2E 


TrEmblne 
w 


BAA83013 


KIAA1061 protein fragment 


Human 


48 


2F 


Embl 


CAB70877.1 


Hypothetical protein DKFzp566D234.1 


Human 


49 


2G 


GenBank 


Q62632 


Follistatin-related protein- 1 precursor 


Rat 


50 


2H 


1 -ran t< o n I/' 




FoUistatin-related protein- 1 precursor 


Mouse 


51 




It An o 

vjcnoanK 


JKJSJ 1 o / 


iTUillDulllli ICIulCU piUlCUl 


A TTir'Jin 

clawed 
frog 




2J 


GenBank 


Q12841 


Follistatin related protein- 1 precursor 


Human 


53 


2K 


Embl 


CAB42968.1 


Flik protein 


Chicken 


54 


2L 


GenBank 


T13822 


Frazzled gene protein 


Fruit fly 


55 


2M 


GenBank 


AAC3 8849.1 


Roundabout 1 


Fruit fly 


56 


2N 


oenoanK 


yJOKJHOy 


Down Syndrome Cell Adhesion Molecule 
Precursor 


Human' 


57 


20 


oWlSSriOi 




Limbic system-associated membrane 
protein precursor 


Human 


58 


2P 


SptrEmbl 


O70246 


Putative neuronal cell adhesion molecule, 
short form 


Mouse 


59 


20 


SptrEmbl 


002869 


CHLAMP, Gl 1-isoform precursor 


Chicken 


60 


2R 


SwissProt 


Q62813 


Limbic system-associated membrane 
protein precursor 


Rat 


61 


3J 


GenBank 


NM_0 11856.2 


Odd Oz/ten-m homology 2 


Fruit fly 


62 


3K 


Embl 


AJ245711.1 


Teneurin-2 cDNA, short splice variant 


Chicken 


63 


3L 


GenBank 


AB032953 


KIAA 1127 cDNA 


Human 


64 


3M, 3U 


GenBank 


AB02541 1 


Ten-m2 cDNA 


Mouse 


65 


3N 


GenBank 


NM_020088.1 


Neurestin alpha cDNA 


Rat 


66 


30 


Embl 


GGA278031 


Teneurin-2 


Chicken 


67 


3P 


GenBank 


NP_035986.2 


Odd Oz/ten-m homology 2 


Fruit fly 


68 


3Q 


Embl 


CAC09416.1 


Teneurin-2 


Chicken 


69 


3R 


GenBank 


BAA77399.1 


Ten-m4 


Mouse 


70 


3S 


GenBank 


AB032953 


KIAA 1127 protein 


Human 


71 


3T 


GenBank 


AF086607 


Neurestin alpha 


Rat 


72 


4C 


SptrEmbl 


Q99233 


Hypothetical 10 kD protein 


Trypanos 
ome 


73 


4C 


SptrEmbl 


Q 16896 


GABA receptor subunit 




74 


4C 


SptrEmbl 


076473 


GABA receptor subunit 




75 


4C 


TrEmblne 
w 


AAD28317 


FI3J11.13 protein 




76 



119 
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Text p. 90 


SptrEmbl 


^^13 


NF-kappa B P65 delta 3 proj^^ 


Human 


77 


5E 


GenBank 


^[o07061.1 


Complement C 1 R-like proteWR 
precursor 


Human 


78 


5F 


GenBank 


NM_001733.1 


Complement component 1 , R 
subcomponent cDNA 


Human 


79 


5G 


GenBank 


AAF44349.1 


Complement CI R-like proteinase 
precursor 


Human 


80 


5H 


GenBank 


AAA5 185.1 


Complement C 1 R component precursor 


Human 


81 


6E 


GenBank 


AB046651 


Brain cDNA clone Qcc- 17034 


Macaque 


82 


6F 


GenBank 


AK09660 


Adult testis cDNA, RIKEN full length 
enriched 


Mouse 


83 


6G 


GenBank 


AB046651 


Hypothetical protein 


Macaque 


84 


6H 


GenBank 


NP_000838.1 


Plasma kallikreinBl precursor 


Human 


85 


61 


GenBank 


BAA37147.1 


Kallikrein 


Pig 


86 


6J 


Embl 


CAA64368.1 


Coagulation factor XI 


Human 


87 


7D,7J 


SptrEmbl 


043692 


25 kDa trypsin inhibitor 


Human 


88 


7D 


SptrEmbl 


044228 


HRTT-I 




89 


7D, 7K 


SptrEmbl 


P4 18060 


Glioma pathogenesis-related protein 


Human 


90 


7D 


PIR-ID 


JC4131 


Glioma pathogenesis-related protein 


Human 


91 


7D 


SwissProt 


019010 


Cysteine-rcih secretory protein 




92 


7E 


GenBank 


AF 142573 


Putatitive secretory protein precursor 
cDNA 


Human 


93 


7F 


GenBank 


AF 142573 


Putative secretory protein precursor 


Human 


94 


7G 


GenBank 


AF 109674 


Late gestation lung protein 1 


Rat 


95 


7H 


GenBank 


D45027 


R3H domain containing preprotein, 25 
kDa trypsin inhibitor 


Human 


96 


71 


Embl 


ALl 17382 


Novel protein similar to a trypsin 
inhibitor 


Human 


97 


7L 


PIR-ID 


S68691 


Neutrophil granules matrix glycoprotein 
SGP28 precursor 


Human 


98 



FCTRX Nucleic Acids and Polypeptides 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
FCTRX polypeptides or biologically-active portions thereof. Also included in the invention are 
nucleic acid fragments sufficient for use as hybridization probes to identify FCTRX-encoding 
nucleic acids (e.g., FCTRX mRNAs) and fragments for use as PCR primers for the amplification 
and/or mutation of FCTRX nucleic acid molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DNA molecules (e.g., cDNA or genomic DNA), RNA 
molecules (e.g., mRNA), analogs of the DNA or RNA generated using nucleotide analogs, and 
derivatives, fi*agments and homologs thereof. The nucleic acid molecule may be single-stranded 
or double-stranded, but preferably is comprised double-stranded DNA. 
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An FCTRX nu^k acid can encode a mature FCTRX pQ|^eptide. As used herein, a 
"mature" form of a pol^l^tide or protein disclosed in the presennnvention is the product of a 
naturally occurring polypeptide or precursor form or proprotein. The naturally occurring 
polypeptide, precursor or proprotein includes, by way of nonlimiting example, the full length 
5 gene product, encoded by the corresponding gene. Alternatively, it may be defined as the 
polypeptide, precursor or proprotein encoded by an ORF described herein. The product 
"mature" form arises, again by way of nonlimiting example, as a result of one or more naturally 
occurring processing steps as they may take place within the cell, or host cell, in which the gene 
product arises. Examples of such processing steps leading to a "mature" form of a polypeptide 

10 or protein include the cleavage of the N-terminal methionine residue encoded by the initiation 
codon of an ORF, or the proteolytic cleavage of a signal peptide or leader sequence. Thus a 
mature form arising from a precursor polypeptide or protein that has residues 1 to N, where 
residue 1 is the N-terminal methionine, would have residues 2 through N remaining after 
removal of the N-terminal methionine. Alternatively, a mature form arising fi-om a precursor 

15 polypeptide or protein having residues 1 to N, in which an N-terminal signal sequence from 
residue 1 to residue M is cleaved, would have the residues from residue M+l to residue N 
remaining. Further as used herein, a "mature" form of a polypeptide or protein may arise from a 
step of post-translational modification other than a proteolytic cleavage event. Such additional 
processes include, by way of non-limiting example, glycosylation, myristoylation or 

20 phosphorylation. In general, a mature polypeptide or protein may result from the operation of 
only one of these processes, or a combination of any of them. 

The term "probes", as utilized herein, refers to nucleic acid sequences of variable length, 
preferably between at least about 10 nucleotides (nt), 100 nt, or as many as approximately, e.g., 
6,000 nt, depending upon the specific use. Probes are used in the detection of identical, similar, 

25 or complementary nucleic acid sequences. Longer length probes are generally obtained from a 
natural or recombinant source, are highly specific, and much slower to hybridize than shorter- 
length oligomer probes. Probes may be single- or double-stranded and designed to have 
specificity in PGR, membrane-based hybridizafion technologies, or ELISA-like technologies. 
The term "isolated" nucleic acid molecule, as utilized herein, is one which is separated 

30 from other nucleic acid molecules which are present in the natural source of the nucleic acid. 
Preferably, an "isolated" nucleic acid is free of sequences which naturally flank the nucleic acid 
(/.e., sequences located at the 5*- and 3'-termini of the nucleic acid) in the genomic DNA of the 
organism from which the nucleic acid is derived. For example, in various embodiments, the 
isolated FCTRX nucleic acid molecules can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 
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genomic DNA of the clWissue from which the nucleic acid is deffved (e.g., brain, heart, liver, 
spleen, etc.). Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material or culture medium when produced by recombinant 
5 techniques, or of chemical precursors or other chemicals when chemically synthesized. 

A nucleic acid molecule of the invention, e.g., a nucleic acid molecule having the 
nucleotide sequence of SEQ ID N0S:1, 3, 5, 7, 9, 10, 11, 12, 14, 16, 18, 20, 22, and 24, or a 
complement of this aforementioned nucleotide sequence, can be isolated using standard 
molecular biology techniques and the sequence information provided herein. Using all or a 
10 portion of the nucleic acid sequence of SEQ ID N0S:1, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, 
and 24 as a hybridization probe, FCTRX molecules can be isolated using standard hybridization 
and cloning techniques (e.g., as described in Sambrook, et al,, (eds.), MOLECULAR CLONING: A 
Laboratory Manual 2"^* Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989; and Ausubel, et aL, (eds.). Current Protocols in Molecular Biology, John Wiley & 
15 Sons, New York, NY, 1993.) 

A nucleic acid of the invention can be amplified using cDNA, mRNA or alternatively, 
genomic DNA, as a template and appropriate oligonucleotide primers according to standard PCR 
amplification techniques. The nucleic acid so amplified can be cloned into an appropriate vector 
and characterized by DNA sequence analysis. Furthermore, oligonucleotides corresponding to 
20 FCTRX nucleotide sequences can be prepared by standard synthetic techniques, e.g., using an 
automated DNA synthesizer. 

As used herein, the term "oligonucleotide" refers to a series of linked nucleotide residues, 
which oligonucleotide has a sufficient number of nucleotide bases to be used in a PCR reaction. 
A short oligonucleotide sequence may be based on, or designed firom, a genomic or cDNA 
25 sequence and is used to amplify, confirm, or reveal the presence of an identical, similar or 

complementary DNA or RNA in a particular cell or tissue. Oligonucleotides comprise portions 
of a nucleic acid sequence having about 10 nt, 50 nt, or 100 nt in length, preferably about 15 nt 
to 30 nt in length. In one embodiment of the invention, an oligonucleotide comprising a nucleic 
acid molecule less than 100 nt in length would fiirther comprise at least 6 contiguous nucleotides 
30 of SEQ ID N0S:1, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24, or a complement thereof 
Oligonucleotides may be chemically synthesized and may also be used as probes. 

In another embodiment, an isolated nucleic acid molecule of the invention comprises a 
nucleic acid molecule that is a complement of the nucleotide sequence shown in SEQ ID N0S:1, 
3, 5, 7, 9, 10, 11, 12, 14, 16, 18, 20, 22, and 24, or a portion of this nucleotide sequence (e.g., a 
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nucleotide sequence shown in SEQ ID N0S:1, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24, is 
one that is sufficiently complementary to the nucleotide sequence shown in SEQ ID N0S:1, 3, 5, 
7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24, that it can hydrogen bond with little or no mismatches 
to the nucleotide sequence shown in SEQ ID N0S:1, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 
24, thereby forming a stable duplex. 

As used herein, the term "complementary" refers to Watson-Crick or Hoogsteen base 
pairing between nucleotides units of a nucleic acid molecule, and the term "binding" means the 
physical or chemical interaction between two polypeptides or compounds or associated 
polypeptides or compounds or combinations thereof. Binding includes ionic, non-ionic, van der 
Waals, hydrophobic interactions, and the like. A physical interaction can be either direct or 
indirect. Indirect interactions may be through or due to the effects of another polypeptide or 
compound. Direct binding refers to interactions that do not take place through, or due to, the 
effect of another polypeptide or compound, but instead are without other substantial chemical 
intermediates. 

Fragments provided herein are defined as sequences of at least 6 (contiguous) nucleic 
acids or at least 4 (contiguous) amino acids, a length sufficient to allow for specific hybridization 
in the case of nucleic acids or for specific recognition of an epitope in the case of amino acids, 
respectively, and are at most some portion less than a fiill length sequence. Fragments may be 
derived fi-om any contiguous portion of a nucleic acid or amino acid sequence of choice. 
Derivatives are nucleic acid sequences or amino acid sequences formed from the native 
compounds either directly or by modification or partial substitution. Analogs are nucleic acid 
sequences or amino acid sequences that have a structure similar to, but not identical to, the native 
compound but differs from it in respect to certain components or side chains. Analogs may be 
synthetic or from a different evolutionary origin and may have a similar or opposite metabolic 
activity compared to wild type. Homologs are nucleic acid sequences or amino acid sequences 
of a particular gene that are derived from different species. 

Derivatives and analogs may be full length or other than full length, if the derivative or 
analog contains a modified nucleic acid or amino acid, as described below. Derivatives or 
analogs of the nucleic acids or proteins of the invention include, but are not limited to, molecules 
comprising regions that are substantially homologous to the nucleic acids or proteins of the 
invention, in various embodiments, by at least about 70%, 80%, or 95% identity (with a 
preferred identity of 80-95%) over a nucleic acid or amino acid sequence of identical size or 
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when compared to an a^«|ed sequence in which the alignment ^^le by a computer homology 
program known in the S^r whose encoding nucleic acid is capame of hybridizing to the 
complement of a sequence encoding the aforementioned proteins under stringent, moderately 
stringent, or low stringent conditions. See e.g. Ausubel, et al.. Current PROTOCOLS IN 
Molecular Biology, John Wiley & Sons, New York, NY, 1993, and below. 

A "homologous nucleic acid sequence" or "homologous amino acid sequence," or 
variations thereof, refer to sequences characterized by a homology at the nucleotide level or 
amino acid level as discussed above. Homologous nucleotide sequences encode those sequences 
coding for isoforms of FCTRX polypeptides. Isoforms can be expressed in different tissues of 
the same organism as a result of, for example, alternative splicing of RNA. Alternatively, 
isoforms can be encoded by different genes. In the invention, homologous nucleotide sequences 
include nucleotide sequences encoding for an FCTRX polypeptide of species other than humans, 
including, but not limited to: vertebrates, and thus can include, e.g., frog, mouse, rat, rabbit, dog, 
cat cow, horse, and other organisms. Homologous nucleotide sequences also include, but are not 
limited to, naturally occurring allelic variations and mutations of the nucleotide sequences set 
forth herein. A homologous nucleotide sequence does not, however, include the exact nucleotide 
sequence encoding human FCTRX protein. Homologous nucleic acid sequences include those 
nucleic acid sequences that encode conservative amino acid substitutions (see below) in SEQ ID 
N0S:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, as well as a polypeptide possessing FCTRX 
biological activity. Various biological activities of the FCTRX proteins are described below. 

An FCTRX polypeptide is encoded by the open reading frame ("ORF") of an FCTRX 
nucleic acid. An ORF corresponds to a nucleotide sequence that could potentially be translated 
into a polypeptide. A stretch of nucleic acids comprising an ORF is uninterrupted by a stop 
codon. An ORF that represents the coding sequence for a frill protein begins with an ATG 
"start" codon and terminates with one of the three "stop" codons, namely, TAA, TAG, or TGA. 
For the purposes of this invention, an ORF may be any part of a coding sequence, with or 
without a start codon, a stop codon, or both. For an ORF to be considered as a good candidate 
for coding for a bona fide cellular protein, a minimum size requirement is often set, e.g., a stretch 
of DNA that would encode a protein of 50 amino acids or more. 

The nucleotide sequences determined from the cloning of the human FCTRX genes 
allows for the generation of probes and primers designed for use in identifying and/or cloning 
FCTRX homologues in other cell types, e.g, from other tissues, as well as FCTRX homologues 
from other vertebrates. The probe/primer typically comprises substantially purified 
oligonucleotide. The oligonucleotide typically comprises a region of nucleotide sequence that 
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hybridizes under stringj^onditions to at least about 12, 25, 5(^|p, 150, 200, 250, 300, 350 or 
400 consecutive sense 5Hnd nucleotide sequence of SEQ ID NOSh, 3, 5, 7, 9, 10, 11, 12, 14, 
16, 18, 20, 22, and 24; or an anti-sense strand nucleotide sequence of SEQ ID N0S:1, 3, 5, 7, 9, 
10, 1 1, 12, 14, 16, 18, 20, 22, and 24; or of a naturally occurring mutant of SEQ ID N0S:1, 3, 5, 
7, 9, 10, 11, 12, 14, 16, 18, 20, 22, and 24. 

Probes based on the human FCTRX nucleotide sequences can be used to detect 
transcripts or genomic sequences encoding the same or homologous proteins. In various 
embodiments, the probe further comprises a label group attached thereto, e.g. the label group can 
be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes can 
be used as a part of a diagnostic test kit for identifying cells or tissues which mis-express an 
FCTRX protein, such as by measuring a level of an FCTRX-encoding nucleic acid in a sample of 
cells from a subject e.g., detecting FCTRX mRNA levels or determining whether a genomic 
FCTRX gene has been mutated or deleted. 

"A polypeptide having a biologically-active portion of an FCTRX polypeptide" refers to 
polypeptides exhibiting activity similar, but not necessarily identical to, an activity of a 
polypeptide of the invention, including mature forms, as measured in a particular biological 
assay, with or without dose dependency. A nucleic acid fragment encoding a "biologically- 
active portion of FCTRX" can be prepared by isolating a portion of SEQ ID NOS: 1, 3, 5, 7, 9, 
10, 1 1, 12, 14, 16, 18, 20, 22, and 24, that encodes a polypeptide having an FCTRX biological 
activity (the biological activities of the FCTRX proteins are described below), expressing the 
encoded portion of FCTRX protein (e.g., by recombinant expression in vitro) and assessing the 
activity of the encoded portion of FCTRX. 

FCTRX Nucleic Acid and Polypeptide Variants 

The invention further encompasses nucleic acid molecules that differ from the nucleotide 
sequences shown in SEQ ID N0S:1, 3, 5, 7, 9, 10, 11, 12, 14, 16, 18, 20, 22, and 24, due to 
degeneracy of the genetic code and thus encode the same FCTRX proteins as that encoded by the 
nucleotide sequences shown in SEQ ID NO N0S:1, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 
24. In another embodiment, an isolated nucleic acid molecule of the invention has a nucleotide 
sequence encoding a protein having an amino acid sequence shown in SEQ ID N0S:2, 4, 6, 8, 
13, 15, 17, 19, 21, 23, and 25. 

In addition to the human FCTRX nucleotide sequences shown in SEQ ID N0S:1, 3, 5, 7, 
9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24, it will be appreciated by those skilled in the art that 
DNA sequence polymorphisms that lead to changes in the amino acid sequences of the FCTRX 
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natural allelic variation. As used herein, the terms "gene" and "recombinant gene" refer to 
nucleic acid molecules comprising an open reading frame (ORF) encoding an FCTRX protein, 
preferably a vertebrate FCTRX protein. Such natural allelic variations can typically result in 
1-5% variance in the nucleotide sequence of the FCTRX genes. Any and all such nucleotide 
variations and resulting amino acid polymorphisms in the FCTRX polypeptides, which are the 
result of natural allelic variation and that do not alter the functional activity of the FCTRX 
polypeptides, are intended to be v^thin the scope of the invention. 

Moreover, nucleic acid molecules encoding FCTRX proteins from other species, and thus 
that have a nucleotide sequence that differs from the human sequence of SEQ ID N0S:1, 3, 5, 7, 
9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24, are intended to be within the scope of the invention. 
Nucleic acid molecules corresponding to natural allelic variants and homologues of the FCTRX 
cDNAs of the invention can be isolated based on their homology to the human FCTRX nucleic 
acids disclosed herein using the human cDNAs, or a portion thereof, as a hybridization probe 
according to standard hybridization techniques under stringent hybridization conditions. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the invention 
is at least 6 nucleotides in length and hybridizes under stringent conditions to the nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID N0S:1, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 
18, 20, 22, and 24. In another embodiment, the nucleic acid is at least 10, 25, 50, 100, 250, 500, 
750, 1000, 1500, or 2000 or more nucleotides in length. In yet another embodiment, an isolated 
nucleic acid molecule of the invention hybridizes to the coding region. As used herein, the term 
"hybridizes under stringent conditions" is intended to describe conditions for hybridization and 
washing under which nucleotide sequences at least 60% homologous to each other typically 
remain hybridized to each other. 

Homologs (Le., nucleic acids encoding FCTRX proteins derived from species other than 
human) or other related sequences {e.g., paralogs) can be obtained by low, moderate or high 
stringency hybridization v^th all or a portion of the particular human sequence as a probe using 
methods well known in the art for nucleic acid hybridization and cloning. 

As used herein, the phrase "stringent hybridization conditions" refers to conditions under 
which a probe, primer or oligonucleotide will hybridize to its target sequence, but to no other 
sequences. Stringent conditions are sequence-dependent and will be different in different 
circumstances. Longer sequences hybridize specifically at higher temperatures than shorter 
sequences. Generally, stringent conditions are selected to be about 5°C lower than the thermal 
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melting point (Tm) for U^specific sequence at a defined ionic ^||^th and pH. The Tm is the 
temperature (under demBn ionic strength, pH and nucleic acid c^rcentration) at which 50% of 
the probes complementary to the target sequence hybridize to the target sequence at equilibrium. 
Since the target sequences are generally present at excess, at Tm, 50% of the probes are occupied 
5 at equilibrium. Typically, stringent conditions will be those in which the salt concentration is 
less than about 1 .0 M sodium ion, typically about 0.01 to 1.0 M sodium ion (or other salts) at 
pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes, primers or 
oligonucleotides (e.g., 10 nt to 50 nt) and at least about 60°C for longer probes, primers and 
oligonucleotides. Stringent conditions may also be achieved with the addition of destabilizing 

10 agents, such as formamide. 

Stringent conditions are known to those skilled in the art and can be found in Ausubel, et 
al, (eds.). Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3.1-6.3.6. Preferably, the conditions are such that sequences at least about 65%, 70%, 75%, 
85%, 90%, 95%, 98%, or 99% homologous to each other typically remain hybridized to each 

15 other. A non-limiting example of stringent hybridization conditions are hybridization in a high 
5 salt buffer comprising 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% 
Ficoll, 0.02% BSA, and 500 mg/ml denatured salmon sperm DNA at 65°C, followed by one or 

vy more washes in 0.2X SSC, 0.01% BSA at 50°C. An isolated nucleic acid molecule of the 

fil 

I " invention that hybridizes under stringent conditions to the sequences of SEQ ID NOS: 1,3,5,7, 

20 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24, corresponds to a naturally-occurring nucleic acid 

Q molecule. As used herein, a "naturally-occurring" nucleic acid molecule refers to an RNA or 

: =ii 
: 

T4 DNA molecule having a nucleotide sequence that occurs in nature (e.g., encodes a natural 

'f- protein). 

In a second embodiment, a nucleic acid sequence that is hybridizable to the nucleic acid 
25 molecule comprising the nucleotide sequence of SEQ ID N0S:1, 3, 5, 7, 9, 10, 11, 12, 14, 16, 
18, 20, 22, and 24, or fragments, analogs or derivatives thereof, under conditions of moderate 
stringency is provided. A non-limiting example of moderate stringency hybridization conditions 
are hybridization in 6X SSC, 5X DenhardVs solution, 0.5% SDS and 100 mg/ml denatured 
salmon sperm DNA at 55°C, followed by one or more washes in IX SSC, 0.1% SDS at 37X. 
30 Other conditions of moderate stringency that may be used are well-known within the art. See, 
e.g., Ausubel, et aL (eds.), 1993, CURRENT PROTOCOLS IN Molecular Biology, John Wiley & 
Sons, NY, and Kriegler, 1990; GENE Transfer and Expression, A Laboratory Manual, 
Stockton Press, NY. 
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In a third emboc^|^nt, a nucleic acid that is hybridizabiQ^the nucleic acid molecule 
comprising the nucleotlRequences of SEQ ID N0S:1, 3, 5, 7,^0, 11,12, 14, 16, 18, 20, 22, 
and 24, or fragments, analogs or derivatives thereof, imder conditions of low stringency, is 
provided. A non-limiting example of low stringency hybridization conditions are hybridization 
in 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% 
Ficoll, 0.2% BSA, 100 mg/ml denatured salmon sperm DNA, 10% (v^vol) dextran sulfate at 
40°C, followed by one or more washes in 2X SSC, 25 mM Tris-HCl (pH 7.4), 5 mM EDTA, and 
0.1% SDS at 50°C. Other conditions of low stringency that may be used are well knovm in the 
art (e.g., as employed for cross-species hybridizations). See, e.g., Ausubel, et al, (eds.), 1993, 
Current Protocols in Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1990, 
Gene Transfer and Expression, A Laboratory Manual, Stockton Press, NY; Shilo and 
Weinberg, \9U. Proc Natl Acad Sci USA 78: 6789-6792. 

Conservative Mutations 

In addition to naturally-occurring allelic variants of FCTRX sequences that may exist in 
the population, the skilled artisan will further appreciate that changes can be introduced by 
mutation into the nucleotide sequences of SEQ ID NO N0S:1, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 
20, 22, and 24, thereby leading to changes in the amino acid sequences of the encoded FCTRX 
proteins, without altering the functional ability of said FCTRX proteins. For example, nucleotide 
substitutions leading to amino acid substitutions at "non-essential" amino acid residues can be 
made in the sequence of SEQ ID N0S:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25. A 
"non-essential" amino acid residue is a residue that can be altered from the wild-type sequences 
of the FCTRX proteins without altering their biological activity, whereas an "essential" amino 
acid residue is required for such biological activity. For example, amino acid residues that are 
conserved among the FCTRX proteins of the invention are predicted to be particularly non- 
amenable to alteration. Amino acids for which conservative substitutions can be made are well- 
known within the art. 

Another aspect of the invention pertains to nucleic acid molecules encoding FCTRX 
proteins that contain changes in amino acid residues that are not essential for activity. Such 
FCTRX proteins differ in amino acid sequence from SEQ ID N0S:2, 4, 6, 8, 13, 15, 17, 19, 21, 
23, and 25, yet retain biological activity. In one embodiment, the isolated nucleic acid molecule 
comprises a nucleotide sequence encoding a protein, wherein the protein comprises an amino 
acid sequence at least about 45% homologous to the amino acid sequences of SEQ ID N0S:2, 4, 
6, 8, 13, 15, 17, 19, 21, 23, and 25. Preferably, the protein encoded by the nucleic acid molecule 
is at least about 60% homologous to SEQ ID N0S:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25; more 
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preferably at least abou^% homologous to SEQ ID N0S:2, 4^|, 13, 15, 17, 19, 21, 23, and 
25; still more preferablPReast about 80% homologous to SEqWnOS:2, 4, 6, 8, 13, 15, 17, 
19, 21, 23, and 25; even more preferably at least about 90% homologous to SEQ ID N0S:2, 4, 6, 
8, 13, 15, 17, 19, 21, 23, and 25; and most preferably at least about 95% homologous to SEQ ID 
5 NOS:2,4,6,8, 13, 15, 17, 19, 21, 23, and 25. 

An isolated nucleic acid molecule encoding an FCTRX protein homologous to the protein 
of SEQ ID N0S:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, can be created by introducing one or 
more nucleotide substitutions, additions or deletions into the nucleotide sequence of SEQ ID 
N0S:1, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24, such that one or more amino acid 
10 substitutions, additions or deletions are introduced into the encoded protein. 

Mutations can be introduced into SEQ ID N0S:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, 
by standard techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis. 
Preferably, conservative amino acid substitutions are made at one or more predicted, 
non-essential amino acid residues. A "conservative amino acid substitution" is one in which the 
ill 15 amino acid residue is replaced with an amino acid residue having a similar side chain. Families 
?=l of amino acid residues having similar side chains have been defined v^thin the art. These 

families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side 
^[1 chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, 

glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, 

f=f 20 leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains 

W 

f J (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, 

f-- tryptophan, histidine). Thus, a predicted non-essential amino acid residue in the FCTRX protein 

is replaced with another amino acid residue from the same side chain family. Alternatively, in 
another embodiment, mutations can be introduced randomly along all or part of an FCTRX 
25 coding sequence, such as by saturation mutagenesis, and the resultant mutants can be screened 
for FCTRX biological activity to identify mutants that retain activity. Following mutagenesis of 
SEQ ID N0S:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, the encoded protein can be expressed by 
any recombinant technology known in the art and the activity of the protein can be determined. 
The relatedness of amino acid families may also be determined based on side chain 
30 interactions. Substituted amino acids may be fully conserved "strong" residues or fully 

conserved "weak" residues. The "strong" group of conserved amino acid residues may be any 
one of the following groups: STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, HY, FYW, 
wherein the single letter amino acid codes are grouped by those amino acids that may be 
substituted for each other. Likewise, the "weak" group of conserved residues may be any one of 
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'EQHK, NEQHRK, 



VLIM, HF Y, wherein tlWetters within each group represent the smgle letter amino acid code. 

In one embodiment, a mutant FCTRX protein can be assayed for (/) the ability to form 
protein:protein interactions with other FCTRX proteins, other cell-surface proteins, or 
biologically-active portions thereof, (//) complex formation between a mutant FCTRX protein 
and an FCTRX ligand; or (///) the ability of a mutant FCTRX protein to bind to an intracellular 
target protein or biologically-active portion thereof; (e.g. avidin proteins). 

In yet another embodiment, a mutant FCTRX protein can be assayed for the ability to 
regulate a specific biological function (e.g., regulation of insulin release). 

Antisense Nucleic Acids 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules that 
are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NOS:l, 3, 5, 7, 9, 10, 11, 12, 14, 16, 18, 20, 22, and 24, or fragments, 
analogs or derivatives thereof. An "antisense" nucleic acid comprises a nucleotide sequence that 
is complementary to a "sense" nucleic acid encoding a protein (e.g., complementary to the 
coding strand of a double-stranded cDNA molecule or complementary to an mRNA sequence). 
In specific aspects, antisense nucleic acid molecules are provided that comprise a sequence 
complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire FCTRX 
coding strand, or to only a portion thereof Nucleic acid molecules encoding fragments, 
homologs, derivatives and analogs of an FCTRX protein of SEQ ID N0S:2, 4, 6, 8, 13, 15, 17, 
19, 21, 23, and 25; or antisense nucleic acids complementary to an FCTRX nucleic acid 



sequence of SEQ ID N0S:1, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24, are additionally 



In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" 
of the coding strand of a nucleotide sequence encoding an FCTRX protein. The term "coding 
region" refers to the region of the nucleotide sequence comprising codons which are translated 
into amino acid residues. In another embodiment, the antisense nucleic acid molecule is 
antisense to a "noncoding region" of the coding strand of a nucleotide sequence encoding the 
FCTRX protein. The term "noncoding region" refers to 5' and 3' sequences which flank the 
coding region that are not translated into amino acids (/.e., also referred to as 5* and 3' 
untranslated regions). 

Given the coding strand sequences encoding the FCTRX protein disclosed herein, 
antisense nucleic acids of the invention can be designed according to the rules of Watson and 



provided. 
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antisense to only a portion of the coding or noncoding region of FCTRX mRNA. For example, 
the antisense oligonucleotide can be complementary to the region surrounding the translation 
start site of FCTRX mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 
20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can 
be constructed using chemical synthesis or enzymatic ligation reactions using procedures known 
in the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be 
chemically synthesized using naturally-occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical stability 
of the duplex formed between the antisense and sense nucleic acids (e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used). 

Examples of modified nucleotides that can be used to generate the antisense nucleic acid 
include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 
4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 
inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, S-methoxyiuacil, 

2- methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 
uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 

3- (3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding an FCTRX protein to thereby inhibit expression of the protein (e.g., by 
inhibiting transcription and/or translation). The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, in the case of an antisense nucleic acid 
molecule that binds to DNA duplexes, through specific interactions in the major groove of the 
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double helix. An ex; 




fif a route of administration of antisei 




[ucleic acid molecules of the 



invention includes dire^mjection at a tissue site. Altematively^mtisense nucleic acid 
molecules can be modified to target selected cells and then administered systemically. For 
example, for systemic administration, antisense molecules can be modified such that they 
specifically bind to receptors or antigens expressed on a selected cell surface (e.g., by linking the 
antisense nucleic acid molecules to peptides or antibodies that bind to cell surface receptors or 
antigens). The antisense nucleic acid molecules can also be delivered to cells using the vectors 
described herein. To achieve sufficient nucleic acid molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the control of a strong pol II or pol III promoter 
are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific 
double-stranded hybrids vAth complementary RNA in which, contrary to the usual (i-units, the 
strands run parallel to each other. See, e.g., Gaultier, et aL, 1987. Nucl. Acids Res, 15: 
6625-6641. The antisense nucleic acid molecule can also comprise a 2'-o-methylribonucleotide 
(see, e.g., Inoue, et aL 1987. Nucl Acids Res. 15: 6131-6148) or a chimeric RNA-DNA analogue 
(see, e.g., Inoue, et a/., 1987. FEBS Lett. 215: 327-330. 

Ribozymes and PNA Moieties 

Nucleic acid modifications include, by way of non-limiting example, modified bases, and 
nucleic acids whose sugar phosphate backbones are modified or derivatized. These 
modifications are carried out at least in part to enhance the chemical stability of the modified 
nucleic acid, such that they may be used, for example, as antisense binding nucleic acids in 
therapeutic applications in a subject. 

In one embodiment, an antisense nucleic acid of the invention is a ribozyme. Ribozymes 
are catalytic RNA molecules v^th ribonuclease activity that are capable of cleaving a 
single-stranded nucleic acid, such as an mRNA, to which they have a complementary region. 
Thus, ribozymes (e.g., hammerhead ribozymes as described in Haselhofif and Gerlach 1988. 
Nature 334: 585-591) can be used to catalytically cleave FCTRX mRNA transcripts to thereby 
inhibit translation of FCTRX mRNA. A ribozyme having specificity for an FCTRX-encoding 
nucleic acid can be designed based upon the nucleotide sequence of an FCTRX cDNA disclosed 
herein (/.e., SEQ ID N0S:1, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24). For example, a 
derivative of a Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence 
of the active site is complementary to the nucleotide sequence to be cleaved in an 
FCTRX-encoding mRNA. See, e.g., U.S. Patent 4,987,071 to Cech, et al and U.S. Patent 
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5,1 16,742 to Cech, et a^CTRX mRNA can also be used to sei^^ catalytic RNA having a 
specific ribonuclease aSWty from a pool of RNA molecules. Se^.g., Bartel et al, (1993) 
Science 261:1411-1418. 

Alternatively, FCTRX gene expression can be inhibited by targeting nucleotide 
5 sequences complementary to the regulatory region of the FCTRX nucleic acid (e.g., the FCTRX 
promoter and/or enhancers) to form triple helical structures that prevent transcription of the 
FCTRX gene in target cells. See, e.g., Helene, 1991 . Anticancer Drug Des, 6: 569-84; Helene, et 
al 1992. Ann. KY, Acad Sci. 660: 27-36; Maher, 1992. Bioassays 14: 807-15. 

In various embodiments, the FCTRX nucleic acids can be modified at the base moiety, 

10 sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or solubility of 
the molecule. For example, the deoxyribose phosphate backbone of the nucleic acids can be 
modified to generate peptide nucleic acids. See, e.g., Hyrup, et al., 1996. BioorgMed Chem 4: 
5-23. As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics 
(e.g., DNA mimics) in which the deoxyribose phosphate backbone is replaced by a 

1 5 pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 

backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup, et aL, 1996. supra; 
Perry-O'Keefe, e/a/., 1996. Proc. Natl Acad Sci. USA 93: 14670-14675. 

20 PNAs of FCTRX can be used in therapeutic and diagnostic applications. For example, 

PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene 
expression by, e.g., inducing transcription or translation arrest or inhibiting replication. PNAs of 
FCTRX can also be used, for example, in the analysis of single base pair mutations in a gene 
(e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in combination 

25 v^th other enzymes, e.g.. Si nucleases (^ee, Hyrup, et al, \996.supra); or as probes or primers 
for DNA sequence and hybridization (^ee, Hyrup, et al., 1996, supra; Perry-O'Keefe, et al, 
1996. supra). 

In another embodiment, PNAs of FCTRX can be modified, e.g., to enhance their stability 
or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation of 
30 PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery known in 
the art. For example, PNA-DNA chimeras of FCTRX can be generated that may combine the 
advantageous properties of PNA and DNA. Such chimeras allow DNA recognition enzymes 
(e.g., RNase H and DNA polymerases) to interact with the DNA portion while the PNA portion 
would provide high binding affinity and specificity. PNA-DNA chimeras can be linked using 
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linkers of appropriate loMlhs selected in terms of base stacking^fcnber of bonds between the 
nucleobases, and orientlWn {see, Hyrup, etal., 1996. supra). TnSynthesis of PNA-DNA 
chimeras can be performed as described in Hyrup, et al, 1996. supra and Finn, et al.^ 1996. Nucl 
Acids Res 24: 3357-3363. For example, a DNA chain can be synthesized on a solid support 
using standard phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 
5*-(4-methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite, can be used between the PNA 
and the 5' end of DNA. See, e.g., Mag, et a/., 1989. Nucl Acid Res 17: 5973-5988. PNA 
monomers are then coupled in a stepwise manner to produce a chimeric molecule with a 5' PNA 
segment and a 3' DNA segment. See, e.g., Finn, et al, 1996. supra. Alternatively, chimeric 
molecules can be synthesized with a 5' DNA segment and a 3' PNA segment. See, e.g., Petersen, 
etaL, 1975. Bioorg Med Chem. Lett. 5: 1119-11124. 

In other embodiments, the oligonucleotide may include other appended groups such as 
peptides (e.g., for targeting host cell receptors in v/vo), or agents facilitating transport across the 
cell membrane {see, e,g,, Letsinger, et al, 1989. Proc. Natl Acad. Sci. U.S.A. 86: 6553-6556; 
Lemaitre, et al, 1987. Proc. Natl. Acad Sci. 84: 648-652; PCT Publication No. WO88/09810) or 
the blood-brain barrier {see, e.g., PCT Publication No. WO 89/10134). In addition, 
oligonucleotides can be modified with hybridization triggered cleavage agents {see, e.g., Krol, et 
aL, 1988. BioTechniques 6:958-976) or intercalating agents {see, e.g., Zon, 1988. Pharm. Res. 5: 
539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a 
peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
cleavage agent, and the like. 

FCTRX Polypeptides 

A polypeptide according to the invention includes a polypeptide including the amino acid 
sequence of FCTRX polypeptides whose sequences are provided in SEQ ID N0S:2, 4, 6, 8, 13, 
15, 17, 19, 21, 23, and 25. The invention also includes a mutant or variant protein any of whose 
residues may be changed from the corresponding residues shown in SEQ ID N0S:2, 4, 6, 8, 13, 
15, 17, 19, 21, 23, and 25, while still encoding a protein that maintains its FCTRX activities and 
physiological functions, or a functional fragment thereof 

In general, an FCTRX variant that preserves FCTRX-like function includes any variant in 
which residues at a particular position in the sequence have been substituted by other amino 
acids, and further include the possibility of inserting an additional residue or residues between 
two residues of the parent protein as well as the possibility of deleting one or more residues from 
the parent sequence. Any amino acid substitution, insertion, or deletion is encompassed by the 
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invention. In favorabiejixumstances, the substitution is a consj^fctive substitution as defined 
above. 

One aspect of the invention pertains to isolated FCTRX proteins, and biologically-active 
portions thereof, or derivatives, fragments, analogs or homologs thereof Also provided are 
polypeptide fragments suitable for use as inunimogens to raise anti-FCTRX antibodies. In one 
embodiment, native FCTRX proteins can be isolated from cells or tissue sources by an 
appropriate purification scheme using standard protein purification techniques. In another 
embodiment, FCTRX proteins are produced by recombinant DNA techniques. Alternative to 
recombinant expression, an FCTRX protein or polypeptide can be synthesized chemically using 
standard peptide synthesis techniques. 

An "isolated" or "purified" polypeptide or protein or biologically-active portion thereof is 
substantially free of cellular material or other contaminating proteins from the cell or tissue 
source from which the FCTRX protein is derived, or substantially free from chemical precursors 
or other chemicals v^hen chemically synthesized. The language "substantially free of cellular 
material" includes preparations of FCTRX proteins in which the protein is separated from 
cellular components of the cells from which it is isolated or recombinantly-produced. In one 
embodiment, the language "substantially free of cellular material" includes preparations of 
FCTRX proteins having less than about 30% (by dry weight) of non-FCTRX proteins (also 
referred to herein as a "contaminating protein"), more preferably less than about 20% of 
non-FCTRX proteins, still more preferably less than about 10% of non-FCTRX proteins, and 
most preferably less than about 5% of non-FCTRX proteins. When the FCTRX protein or 
biologically-active portion thereof is recombinantly-produced, it is also preferably substantially 
free of culture medium, i.e., culture medium represents less than about 20%, more preferably less 
than about 10%, and most preferably less than about 5% of the volume of the FCTRX protein 
preparation. 

The language "substantially free of chemical precursors or other chemicals" includes 
preparations of FCTRX proteins in which the protein is separated from chemical precursors or 
other chemicals that are involved in the synthesis of the protein. In one embodiment, the 
language "substantially free of chemical preciirsors or other chemicals" includes preparations of 
FCTRX proteins having less than about 30% (by dry weight) of chemical precursors or 
non-FCTRX chemicals, more preferably less than about 20% chemical precursors or 
non-FCTRX chemicals, still more preferably less than about 10% chemical precursors or 
non-FCTRX chemicals, and most preferably less than about 5% chemical precursors or 
non-FCTRX chemicals. 
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Biologically-Mfij^ portions of FCTRX proteins includ^^ptides comprising amino acid 
sequences sufficientl^Bnologous to or derived from the amino acid sequences of the FCTRX 
proteins {e.g., the amino acid sequence shown in SEQ ID N0S:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, 
and 25) that include fewer amino acids than the full-length FCTRX proteins, and exhibit at least 
one activity of an FCTRX protein. Typically, biologically-active portions comprise a domain or 
motif with at least one activity of the FCTRX protein. A biologically-active portion of an 
FCTRX protein can be a polypeptide which is, for example, 10, 25, 50, 100 or more amino acid 
residues in length. 

Moreover, other biologically-active portions, in which other regions of the protein are 
deleted, can be prepared by recombinant techniques and evaluated for one or more of the 
fimctional activities of a native FCTRX protein. 

In an embodiment, the FCTRX protein has an amino acid sequence shown in SEQ ID 
N0S:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25. In other embodiments, the FCTRX protein is 
substantially homologous to SEQ ID N0S:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, and retains 
the fimctional activity of the protein of SEQ ID N0S:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, yet 
differs in amino acid sequence due to natural allelic variation or mutagenesis, as described in 
detail, below. Accordingly, in another embodiment, the FCTRX protein is a protein that 
comprises an amino acid sequence at least about 45% homologous to the amino acid sequence of 
SEQ ID N0S:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, and retains the fimctional activity of the 
FCTRX proteins of SEQ ID N0S:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25. 

Determining Homology Between Two or More Sequences 

To determine the percent homology of two amino acid sequences or of two nucleic acids, 
the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in the 
sequence of a first amino acid or nucleic acid sequence for optimal alignment with a second 
amino or nucleic acid sequence). The amino acid residues or nucleotides at corresponding amino 
acid positions or nucleotide positions are then compared. When a position in the first sequence 
is occupied by the same amino acid residue or nucleotide as the corresponding position in the 
second sequence, then the molecules are homologous at that position {i.e., as used herein amino 
acid or nucleic acid "homology" is equivalent to amino acid or nucleic acid "identity"). 

The nucleic acid sequence homology may be determined as the degree of identity 
between two sequences. The homology may be determined using computer programs known in 
the art, such as GAP sofiAvare provided in the GCG program package. See, Needleman and 
Wunsch, 1970. J Mol Biol 48: 443-453. Using GCG GAP software with the following settings 
for nucleic acid sequence comparison: GAP creation penalty of 5.0 and GAP extension penalty 
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of 0.3, the coding regioji^the analogous nucleic acid sequena 





ferred to above exhibits a 



degree of identity prefeTHry of at least 70%, 75%, 80%, 85%, 90^ 95%, 98%, or 99%, with the 
CDS (encoding) part of the DNA sequence shown in SEQ ID N0S:1, 3, 5, 7, 9, 10, 1 1, 12, 14, 
16, 18,20, 22, and 24. 

The term "sequence identity" refers to the degree to which two polynucleotide or 
polypeptide sequences are identical on a residue-by-residue basis over a particular region of 
comparison. The term "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over that region of comparison, determining the number of positions 
at which the identical nucleic acid base (e,g,, A, T, C, G, U, or I, in the case of nucleic acids) 
occurs in both sequences to yield the number of matched positions, dividing the nimiber of 
matched positions by the total number of positions in the region of comparison (Le., the window 
size), and multiplying the result by 100 to yield the percentage of sequence identity. The term 
"substantial identity" as used herein denotes a characteristic of a polynucleotide sequence, 
wherein the polynucleotide comprises a sequence that has at least 80 percent sequence identity, 
preferably at least 85 percent identity and often 90 to 95 percent sequence identity, more usually 
at least 99 percent sequence identity as compared to a reference sequence over a comparison 
region. 

Chimeric and Fusion Proteins 

The invention also provides FCTRX chimeric or ftision proteins. As used herein, an 
FCTRX "chimeric protein" or "fusion protein" comprises an FCTRX polypeptide operatively- 
linked to a non-FCTRX polypeptide. An "FCTRX polypeptide" refers to a polypeptide having 
an amino acid sequence corresponding to an FCTRX protein (SEQ ID N0S:2, 4, 6, 8, 13, 15, 17, 
19, 21, 23, and 25), whereas a "non-FCTRX polypeptide" refers to a polypeptide having an 
amino acid sequence corresponding to a protein that is not substantially homologous to the 
FCTRX protein, e.g., a protein that is different from the FCTRX protein and that is derived from 
the same or a different organism. Within an FCTRX fusion protein the FCTRX polypeptide can 
correspond to all or a portion of an FCTRX protein. In one embodiment, an FCTRX fusion 
protein comprises at least one biologically-active portion of an FCTRX protein. In another 
embodiment, an FCTRX fusion protein comprises at least two biologically-active portions of an 
FCTRX protein. In yet another embodiment, an FCTRX fusion protein comprises at least three 
biologically-active portions of an FCTRX protein. Within the fusion protein, the term 
"operatively-linked" is intended to indicate that the FCTRX polypeptide and the non-FCTRX 
polypeptide are fused in-frame with one another. The non-FCTRX polypeptide can be fused to 
the N-terminus or C-terminus of the FCTRX polypeptide. 
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In one embodL 




the fusion protein is a GST-FCT] 




pn protein in which the 



FCTRX sequences are Wed to the C-terminus of the GST (glutamfone S-transferase) sequences. 
Such fusion proteins can facilitate the purification of recombinant FCTRX polypeptides. 

In another embodiment, the fusion protein is an FCTRX protein containing a 
heterologous signal sequence at its N-terminus. In certain host cells (e.g., manunalian host 
cells), expression and/or secretion of FCTRX can be increased through use of a heterologous 
signal sequence. 

In yet another embodiment, the fusion protein is an FCTRX-immunoglobulin fusion 
protein in which the FCTRX sequences are fused to sequences derived from a member of the 
immunoglobulin protein family. The FCTRX-immunoglobulin fusion proteins of the invention 
can be incorporated into pharmaceutical compositions and administered to a subject to inhibit an 
interaction between an FCTRX ligand and an FCTRX protein on the surface of a cell, to thereby 
suppress FCTRX-mediated signal transduction in vivo. The FCTRX-immunoglobulin fusion 
proteins can be used to affect the bioavailability of an FCTRX cognate ligand. Inhibition of the 
FCTRX ligand/FCTRX interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, as well as modulating (e.g. promoting or inhibiting) 
cell survival. Moreover, the FCTRX-inununoglobulin fusion proteins of the invention can be 
used as immunogens to produce anti-FCTRX antibodies in a subject, to purify FCTRX ligands, 
and in screening assays to identify molecules that inhibit the interaction of FCTRX with an 
FCTRX ligand. 

An FCTRX chimeric or fusion protein of the invention can be produced by standard 
recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional techniques, 
e.g., by employing blunt-ended or stagger-ended termini for ligation, restriction enzyme 
digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, alkaline 
phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In another 
embodiment, the fusion gene can be synthesized by conventional techniques including automated 
DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried out using 
anchor primers that give rise to complementary overhangs between two consecutive gene 
fragments that can subsequently be aimealed and reamplified to generate a chimeric gene 
sequence (see, e.g., Ausubel, et al (eds.) CURRENT PROTOCOLS IN Molecular Biology, John 
Wiley & Sons, 1992). Moreover, many expression vectors are conmiercially available that 
already encode a fusion moiety {e.g., a GST polypeptide). An FCTRX-encoding nucleic acid 
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can be cloned into such^|^xpression vector such that the flision^^ety is linked in-frame to the 
FCTRX protein. 

FCTRX Agonists and Antagonists 

The invention also pertains to variants of the FCTRX proteins that function as either 
5 FCTRX agonists {i.e., mimetics) or as FCTRX antagonists. Variants of the FCTRX protein can 
be generated by mutagenesis {e.g., discrete point mutation or truncation of the FCTRX protein). 
An agonist of the FCTRX protein can retain substantially the same, or a subset of, the biological 
activities of the naturally occurring form of the FCTRX protein. An antagonist of the FCTRX 
protein can inhibit one or more of the activities of the naturally occurring form of the FCTRX 

10 protein by, for example, competitively binding to a downstream or upstream member of a 

cellular signaling cascade which includes the FCTRX protein. Thus, specific biological effects 
can be elicited by treatment with a variant of limited function. In one embodiment, treatment of 
a subject with a variant having a subset of the biological activities of the naturally occurring form 
of the protein has fewer side effects in a subject relative to treatment with the naturally occurring 

15 form of the FCTRX proteins. 

Variants of the FCTRX proteins that function as either FCTRX agonists {i.e., mimetics) 
or as FCTRX antagonists can be identified by screening combinatorial libraries of mutants {e.g., 
truncation mutants) of the FCTRX proteins for FCTRX protein agonist or antagonist activity. In 
one embodiment, a variegated library of FCTRX variants is generated by combinatorial 

20 mutagenesis at the nucleic acid level and is encoded by a variegated gene library. A variegated 
library of FCTRX variants can be produced by, for example, enzymatically ligating a mixture of 
synthetic oligonucleotides into gene sequences such that a degenerate set of potential FCTRX 
sequences is expressible as individual polypeptides, or alternatively, as a set of larger fusion 
proteins {e.g., for phage display) containing the set of FCTRX sequences therein. There are a 

25 variety of methods which can be used to produce libraries of potential FCTRX variants from a 
degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be 
performed in an automatic DNA synthesizer, and the synthetic gene then ligated into an 
appropriate expression vector. Use of a degenerate set of genes allows for the provision, in one 
mixture, of all of the sequences encoding the desired set of potential FCTRX sequences. 

30 Methods for synthesizing degenerate oligonucleotides are well-known within the art. See, e.g., 
Narang, 1983. Tetrahedron 39: 3; Itakura, et aL, 1984. Annu. Rev. Biochem. 53: 323; Itakura, et 
ai, 1 984. Science 1 98: 1 056; Ike, et al, 1 983. Nucl. Acids Res. 1 1 : 477. 
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. Polypeptide Libmj^s 

In addition, libr^^ of fragments of the FCTRX protein coding sequences can be used to 
generate a variegated population of FCTRX fragments for screening and subsequent selection of 
variants of an FCTRX protein. In one embodiment, a library of coding sequence fragments can 
be generated by treating a double stranded PCR fragment of an FCTRX coding sequence with a 
nuclease imder conditions wherein nicking occurs only about once per molecule, denaturing the 
double stranded DNA, renaturing the DNA to form double-stranded DNA that can include 
sense/antisense pairs from different nicked products, removing single stranded portions from 
reformed duplexes by treatment with Si nuclease, and ligating the resulting fragment library into 
an expression vector. By this method, expression libraries can be derived which encodes 
N-terminal and internal fragments of various sizes of the FCTRX proteins. 

Various techniques are known in the art for screening gene products of combinatorial 
libraries made by point mutations or truncation, and for screening cDNA libraries for gene 
products having a selected property. Such techniques are adaptable for rapid screening of the 
gene libraries generated by the combinatorial mutagenesis of FCTRX proteins. The most widely 
used techniques, which are amenable to high throughput analysis, for screening large gene 
libraries typically include cloning the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting library of vectors, and expressing the 
combinatorial genes under conditions in which detection of a desired activity facilitates isolation 
of the vector encoding the gene whose product was detected. Recursive ensemble mutagenesis 
(REM), a new technique that enhances the frequency of fimctional mutants in the libraries, can 
be used in combination with the screening assays to identify FCTRX variants. See, e.g., Arkin 
and Yourvan, 1992. Proc. Natl. Acad. Sci. USA 89: 781 1-7815; Delgrave, et al, 1993. Protein 
Engineering 6:327-33 1 . 

Anti-FCTRX Antibodies 

The invention encompasses antibodies and antibody fragments, such as Fab or (Fab)2, that 
bind immunospecifically to any of the FCTRX polypeptides of said invention. 

An isolated FCTRX protein, or a portion or fragment thereof, can be used as an 
immunogen to generate antibodies that bind to FCTRX polypeptides using standard techniques 
for polyclonal and monoclonal antibody preparation. The ftill-length FCTRX proteins can be 
used or, alternatively, the invention provides antigenic peptide fragments of FCTRX proteins for 
use as immunogens. The antigenic FCTRX peptides comprises at least 4 amino acid residues of 
the amino acid sequence shown in SEQ ID NO N0S:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, and 
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encompasses an epitop^^FCTRX such that an antibody raisec^^inst the peptide forms a 
specific immune compreW^/ith FCTRX. Preferably, the antigem^eptide comprises at least 6, 8, 
10, 15, 20, or 30 amino acid residues. Longer antigenic peptides are sometimes preferable over 
shorter antigenic peptides, depending on use and according to methods well known to someone 
5 skilled in the art. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of FCTRX that is located on the surface of the protein (e.g., a 
hydrophilic region). As a means for targeting antibody production, hydropathy plots showing 
regions of hydrophilicity and hydrophobicity may be generated by any method well known in the 

1 0 art, including, for example, the Kyte Doolittle or the Hopp Woods methods, either with or 

without Fourier transformation {see, e.g., Hopp and Woods, 1981. Proc. Nat. Acad ScL USA 78: 
3824-3828; Kyte and Doolittle, 1982. J. Mol Biol. 157: 105-142, each incorporated herein by 
reference in their entirety). 

As disclosed herein, FCTRX protein sequences of SEQ ID N0S:2, 4, 6, 8, 13, 15, 17, 19, 

15 21 , 23, and 25, or derivatives, fragments, analogs or homologs thereof, may be utilized as 
inmiunogens in the generation of antibodies that immunospecifically-bind these protein 
components. The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically-active portions of immunoglobulin molecules, i.e., molecules that contain an 
antigen binding site that specifically-binds (immunoreacts with) an antigen, such as FCTRX. 

20 Such antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain. 
Fab and F(ab')2 fragments, and an Fab expression library. In a specific embodiment, antibodies to 
human FCTRX proteins are disclosed. Various procedures known within the art may be used for 
the production of polyclonal or monoclonal antibodies to an FCTRX protein sequence of SEQ ID 
N0S:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, or a derivative, fragment, analog or homolog 

25 thereof Some of these proteins are discussed below. 

For the production of polyclonal antibodies, various suitable host animals (e.g., rabbit, 
goat, mouse or other mammal) may be inmiunized by injection with the native protein, or a 
synthetic variant thereof, or a derivative of the foregoing. An appropriate immunogenic 
preparation can contain, for example, recombinantly-expressed FCTRX protein or a chemically- 

30 synthesized FCTRX polypeptide. The preparation can further include an adjuvant. Various 

adjuvants used to increase the immunological response include, but are not limited to, Freund's 
(complete and incomplete), mineral gels (e.g., aluminum hydroxide), surface active substances 
(e.g., lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, dinitrophenol, etc.), 
human adjuvants such as Bacille Calmette-Guehn and Corynebacterium parvum, or similar 
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isolated from the mi 



immunostimulatory agi 




such as protein A chromatography to obtain the IgG fraction. 

The term "monoclonal antibody" or "monoclonal antibody composition", as used herein, 
refers to a population of antibody molecules that contain only one species of an antigen binding 
site capable of immunoreacting with a particular epitope of FCTRX. A monoclonal antibody 
composition thus typically displays a single binding affinity for a particular FCTRX protein with 
which it immunoreacts. For preparation of monoclonal antibodies directed towards a particular 
FCTRX protein, or derivatives, fragments, analogs or homologs thereof, any technique that 
provides for the production of antibody molecules by continuous cell line culture may be 
utilized. Such techniques include, but are not limited to, the hybridoma technique {see, e,g., 
Kohler & Milstein, 1975. Nature 256: 495-497); the trioma technique; the human B-cell 
hybridoma technique (see, e.g., Kozbor, et al, 1983. Immunol Today 4: 72) and the EBV 
hybridoma technique to produce human monoclonal antibodies {see, e.g., Cole, et al, 1985. In: 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human 
monoclonal antibodies may be utilized in the practice of the invention and may be produced by 
using human hybridomas {see, e.g., Cote, et al, 1983. Proc Natl Acad Sci USA 80: 2026-2030) 
or by transforming human B-cells with Epstein Barr Virus in vitro {see, e.g., Cole, et al, 1985. . 
In: Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Each of 
the aboye citations is incorporated herein by reference in their entirety. 

According to the invention, techniques can be adapted for the production of single-chain 
antibodies specific to an FCTRX protein {see, e.g., U.S. Patent No. 4,946,778). In addition, 
methods can be adapted for the construction of Fab expression libraries {see, e.g., Huse, et al, 
1989. Science 246: 1275-1281) to allow rapid and effective identification of monoclonal Fab 
fragments with the desired specificity for an FCTRX protein or derivatives, fragments, analogs 
or homologs thereof. Non-human antibodies can be "humanized" by techniques well known in 
the art. See, e.g., U.S. Patent No. 5,225,539. Antibody fragments that contain the idiotypes to an 
FCTRX protein may be produced by techniques known in the art including, but not limited to: (/) 
an F(ab')2 fragment produced by pepsin digestion of an antibody molecule; (//) an Fab fragment 
generated by reducing the disulfide bridges of an F(ab')2 fragment; (///) an Fab fragment generated 
by the treatment of the antibody molecule with papain and a reducing agent; and (/v) Fy 
fragments. 

Additionally, recombinant anti-FCTRX antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be made 
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using standard recombi]^|t DNA techniques, are within the scoj^^ the invention. Such 
chimeric and humaniz«BR)noclonal antibodies can be produce^^ recombinant DNA 
techniques known in the art, for example using methods described in International Application 
No. PCTAJS86/02269; European Patent Application No. 184,187; European Patent Application 
5 No. 171,496; European Patent Application No. 173,494; PCT International Publication No. WO 
86/01533; U.S. Patent No. 4,816,567; U.S. Pat. No. 5,225,539; European Patent Application No. 
125,023; Better, et al, 1988. Science 240: 1041-1043; Liu, et al., 1987. Proc. Natl. Acad. Set 
USA 84: 3439-3443; Liu, et al, 1987. J. Immunol. 139: 3521-3526; Sun, et al, 1987. Proc. Natl 
Acad. Sci. USA 84: 214-218; Nishimura, et al, 1987. Cancer Res. 47: 999-1005; Wood, et al, 
10 1985. Nature 3\4 :446-449; Shaw, et al, 1988. J. Natl Cancer Inst. 80: 1553-1559); 

Morrison(1985) Science 229:1202-1207; Oi, et al (1986) BioTechniques 4:214; Jones, et al, 
1986. Nature 321 : 552-525; Verhoeyan, et al, 1988. Science 239: 1534; and Beidler, et al, 
1988. J. Immunol. 141: 4053-4060. Each of the above citations are incorporated herein by 
reference in their entirety. 
iEi 15 In one embodiment, methods for the screening of antibodies that possess the desired 

CI specificity include, but are not limited to, enzyme-linked immunosorbent assay (ELISA) and 

'T[ other immunologically-mediated techniques known within the art. In a specific embodiment, 

selection of antibodies that are specific to a particular domain of an FCTRX protein is facilitated 
5 by generation of hybridomas that bind to the fragment of an FCTRX protein possessing such a 

J^l 20 domain. Thus, antibodies that are specific for a desired domain within an FCTRX protein, or 

Ul 

CzJ derivatives, fragments, analogs or homologs thereof, are also provided herein. 

in 

h% Anti-FCTRX antibodies may be used in methods knovra within the art relating to the 

^'"^ localization and/or quantitation of an FCTRX protein (e.g., fox use in measuring levels of the 

FCTRX protein v^thin appropriate physiological samples, for use in diagnostic methods, for use 
25 in imaging the protein, and the like). In a given embodiment, antibodies for FCTRX proteins, or 
derivatives, fragments, analogs or homologs thereof, that contain the antibody derived binding 
domain, are utilized as pharmacologically-active compounds (hereinafter "Therapeutics"). 

An anti-FCTRX antibody {e.g., monoclonal antibody) can be used to isolate an FCTRX 
polypeptide by standard techniques, such as affinity chromatography or immunoprecipitation. 
30 An anti-FCTRX antibody can facilitate the purification of natural FCTRX polypeptide from cells 
and of recombinantly-produced FCTRX polypepfide expressed in host cells. Moreover, an 
anti-FCTRX antibody can be used to detect FCTRX protein (e.g., in a cellular lysate or cell 
supernatant) in order to evaluate the abundance and pattern of expression of the FCTRX protein. 
Anti-FCTRX antibodies can be used diagnostically to monitor protein levels in tissue as part of a 
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clinical testing proceduj^^.g., to, for example, determine the e£|^cy of a given treatment 
regimen. Detection caWr facilitated by coupling (/.e., physicall^nnking) the antibody to a 
detectable substance. Examples of detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, bioluminescent materials, and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, 
p-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes 
include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes 
luminol; examples of bioluminescent materials include luciferase, luciferin, and aequorin, and 
examples of suitable radioactive material include '^^I, "^^S or ^H. 

FCTRX Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding an FCTRX protein, or derivatives, fragments, analogs or 
homologs thereof As used herein, the term "vector" refers to a nucleic acid molecule capable of 
transporting another nucleic acid to which it has been linked. One type of vector is a "plasmid", 
which refers to a circular double stranded DNA loop into which additional DNA segments can 
be ligated. Another type of vector is a viral vector, wherein additional DNA segments can be 
ligated into the viral genome. Certain vectors are capable of autonomous replication in a host 
cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication 
and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon introduction into the host cell, and thereby are 
replicated along with the host genome. Moreover, certain vectors are capable of directing the 
expression of genes to which they are operatively-linked. Such vectors are referred to herein as 
"expression vectors". In general, expression vectors of utility in recombinant DNA techniques 
are often in the form of plasmids. In the present specification, "plasmid" and "vector" can be 
used interchangeably as the plasmid is the most commonly used form of vector. However, the 
invention is intended to include such other forms of expression vectors, such as viral vectors 
(e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), which serve 
equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
invention in a form suitable for expression of the nucleic acid in a host cell, which means that the 
recombinant expression vectors include one or more regulatory sequences, selected on the basis 
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of the host cells to be 




for expression, that is operatively-li] 




lo the nucleic acid sequence 



to be expressed. Withi^Recombinant expression vector, "operamy-Iinked" is intended to mean 
that the nucleotide sequence of interest is Hnked to the regulatory sequence(s) in a manner that 
allows for expression of the nucleotide sequence {e.g., in an in vitro transcription/translation 
system or in a host cell when the vector is introduced into the host cell). 

The term "regulatory sequence" is intended to includes promoters, enhancers and other 
expression control elements (e.g., polyadenylation signals). Such regulatory sequences are 
described, for example, in Goeddel, Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, Calif (1990). Regulatory sequences include 
those that direct constitutive expression of a nucleotide sequence in many types of host cell and 
those that direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art that the 
design of the expression vector can depend on such factors as the choice of the host cell to be 
transformed, the level of expression of protein desired, etc. The expression vectors of the 
invention can be introduced into host cells to thereby produce proteins or peptides, including 
fusion proteins or peptides, encoded by nucleic acids as described herein (e.g., FCTRX proteins, 
mutant forms of FCTRX proteins, fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed for expression of 
FCTRX proteins in prokaryotic or eukaryotic cells. For example, FCTRX proteins can be 
expressed in bacterial cells such as Escherichia coli, insect cells (using baculovirus expression 
vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, 
Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, 
Calif (1990). Alternatively, the recombinant expression vector can be transcribed and translated 
in vitro ^ for example using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in Escherichia coli with 
vectors containing constitutive or inducible promoters directing the expression of either fusion or 
non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded therein, 
usually to the amino terminus of the recombinant protein. Such fusion vectors typically serve 
three purposes: (/) to increase expression of recombinant protein; (//) to increase the solubility of 
the recombinant protein; and (///) to aid in the purification of the recombinant protein by acting 
as a ligand in affinity purification. Often, in fusion expression vectors, a proteolytic cleavage 
site is introduced at the jimction of the fusion moiety and the recombinant protein to enable 
separation of the recombinant protein from the fusion moiety subsequent to purification of the 
fusion protein. Such enzymes, and their cognate recognition sequences, include Factor Xa, 
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thrombin and enterokin^^ Typical fusion expression vectors io^de pGEX (Pharmacia 





Biotech Inc; Smith ancWiinson, 1988. Gene 67: 31-40), pMAL (Wew England Biolabs, Beverly, 



Mass.) and pRIT5 (Pharmacia, Piscataway, N.J.) that fuse glutathione S-transferase (GST), 
maltose E binding protein, or protein A, respectively, to the target recombinant protein. 



5 



Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 



(Amrann et al, (1988) Gene 69:301-315) and pET lid (Studier et a/.. Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990) 
60-89). 

One strategy to maximize recombinant protein expression in E, coli is to express the 

10 protein in a host bacteria v^th an impaired capacity to proteolytically cleave the recombinant 

protein. See, e.g., Gottesman, Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, Calif. (1990) 1 19-128. Another strategy is to alter the nucleic acid 
sequence of the nucleic acid to be inserted into an expression vector so that the individual codons 
for each amino acid are those preferentially utilized in E. coli {see, e.g,, Wada, et aL, 1992. Nucl 

15 Acids Res, 20: 2111-211 8). Such alteration of nucleic acid sequences of the invention can be 
carried out by standard DNA synthesis techniques. 

In another embodiment, the FCTRX expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast Saccharomyces cerivisae include pYepSecl 
(Baldari, et a/., 1987. EMBOJ. 6: 229-234), pMFa (Kurjan and Herskowitz, 1982. Cell 30: 

20 933-943), pJRY88 (Schultz et aL, 1987. Gene 54: 1 13-123), pYES2 (Invitrogen Corporation, 
San Diego, Calif), and picZ (InVitrogen Corp, San Diego, Calif). 

Alternatively, FCTRX can be expressed in insect cells using baculovirus expression 
vectors. Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., 
SF9 cells) include the pAc series (Smith, et a/., 1983. Mol Cell Biol 3: 2156-2165) and the pVL 

25 series (Lucklowand Summers, 1989. Virology 170: 31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in mammalian 
cells using a mammalian expression vector. Examples of mammalian expression vectors include 
pCDM8 (Seed, 1987. Nature 329: 840) and pMT2PC (Kaufman, et al, 1987. EMBOl 6: 
187-195). When used in mammalian cells, the expression vector's control functions are often 

30 provided by viral regulatory elements. For example, commonly used promoters are derived from 
polyoma, adenovirus 2, cytomegalovirus, and simian virus 40. For other suitable expression 
systems for both prokaryotic and eukaryotic cells see, e.g,. Chapters 16 and 17 of Sambrook, et 
al. Molecular Cloning: A Laboratory Manual. 2nd ed.. Cold Spring Harbor Laboratory, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. 
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In another emb^inent, the recombinant mammalian exij^sion vector is capable of 
directing expression oHlRiucleic acid preferentially in a partici^F cell type (e.g., 
tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 
regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific 
5 promoters include the albumin promoter (liver-specific; Pinkert, et aL, 1987. Genes Dev. 1 : 

268-277), lymphoid-specific promoters (Calame and Eaton, 1988. Adv, Immunol 43: 235-275), 
in particular promoters of T cell receptors (Winoto and Baltimore, 1989. EMBOJ. 8: 729-733) 
and immunoglobulins (Banerji, et al, 1983. Cell 33: 729-740; Queen and Baltimore, 1983. Cell 
33: 741-748), neuron-specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle, 

10 1989. Proc. Natl Acad. Scl USA 86: 5473-5477), pancreas-specific promoters (Edlund, et al, 

1985. Science 230: 912-916), and mammary gland-specific promoters {e.g., milk whey promoter; 
U.S. Pat. No. 4,873,316 and European Application Publication No. 264,166). Developmentally- 
regulated promoters are also encompassed, e.g., the murine hox promoters (Kessel and Gruss, 
1990. Science 249: 374-379) and the a-fetoprotein promoter (Campes and Tilghman, 1989. 

15 Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
molecule of the invention cloned into the expression vector in an antisense orientation. That is, 
the DNA molecule is operatively-linked to a regulatory sequence in a manner that allows for 
expression (by transcription of the DNA molecule) of an RNA molecule that is antisense to 

20 FCTRX mRNA. Regulatory sequences operatively linked to a nucleic acid cloned in the 

antisense orientation can be chosen that direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen that direct constitutive, tissue specific or cell type specific expression of 
antisense RNA. The antisense expression vector can be in the form of a recombinant plasmid, 

25 phagemid or attenuated virus in which antisense nucleic acids are produced imder the control of 
a high efficiency regulatory region, the activity of which can be determined by the cell type into 
which the vector is introduced. For a discussion of the regulation of gene expression using 
antisense genes see, e.g., Weintraub, et aL, "Antisense RNA as a molecular tool for genetic 
analysis," Reviews-Trends in Genetics, Vol. 1(1) 1986. 

30 Another aspect of the invention pertains to host cells into which a recombinant 

expression vector of the invention has been introduced. The terms "host cell" and "recombinant 
host cell" are used interchangeably herein. It is understood that such terms refer not only to the 
particular subject cell but also to the progeny or potential progeny of such a cell. Because certain 
modifications may occur in succeeding generations due to either mutation or environmental 
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A host cell can be any prokaryotic or eukaryotic cell. For example, FCTRX protein can 
be expressed in bacterial cells such as E, coli, insect cells, yeast or mammalian cells (such as 
Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to those 
skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional 
transformation or transfection techniques. As used herein, the terms "transformation" and 
"transfection" are intended to refer to a variety of art-recognized techniques for introducing 
foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium 
chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or electroporation. 
Suitable methods for transforming or transfecting host cells can be found in Sambrook, et al. 
(Molecular Cloning: A Laboratory Manual. 2nd ed.. Cold Spring Harbor Laboratory, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory 
manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may integrate 
the foreign DNA into their genome. In order to identify and select these integrants, a gene that 
encodes a selectable marker (e.g., resistance to antibiotics) is generally introduced into the host 
cells along with the gene of interest. Various selectable markers include those that confer 
resistance to drugs, such as G418, hygromycin and methotrexate. Nucleic acid encoding a 
selectable marker can be introduced into a host cell on the same vector as that encoding FCTRX 
or can be introduced on a separate vector. Cells stably transfected with the introduced nucleic 
acid can be identified by drug selection (e.g., cells that have incorporated the selectable marker 
gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can 
be used to produce (/.e., express) FCTRX protein. Accordingly, the invention further provides 
methods for producing FCTRX protein using the host cells of the invention. In one embodiment, 
the method comprises culturing the host cell of invention (into which a recombinant expression 
vector encoding FCTRX protein has been introduced) in a suitable medium such that FCTRX 
protein is produced. In another embodiment, the method further comprises isolating FCTRX 
protein from the medium or the host cell. 
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Transgenic FCTRX ^^Mnals 

The host cells ot tne invention can also be used to produce non-human transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized oocyte or an 
embryonic stem cell into v^hich FCTRX protein-coding sequences have been introduced. Such 
5 host cells can then be used to create non-human transgenic animals in which exogenous FCTRX 
sequences have been introduced into their genome or homologous recombinant animals in which 
endogenous FCTRX sequences have been altered. Such animals are useful for studying the 
function and/or activity of FCTRX protein and for identifying and/or evaluating modulators of 
FCTRX protein activity. As used herein, a "transgenic animal" is a non-human animal, 
10 preferably a mammal, more preferably a rodent such as a rat or mouse, in which one or more of 
the cells of the animal includes a transgene. Other examples of transgenic animals include 
non-human primates, sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is 
exogenous DNA that is integrated into the genome of a cell from which a transgenic animal 
Q develops and that remains in the genome of the mature animal, thereby directing the expression 

ff5 15 of an encoded gene product in one or more cell types or tissues of the transgenic animal. As 
J=f used herein, a "homologous recombinant animal" is a non-human animal, preferably a mammal, 

Ui: more preferably a mouse, in which an endogenous FCTRX gene has been altered by homologous 

Jji recombination between the endogenous gene and an exogenous DNA molecule introduced into a 

f cell of the animal, e.g., an embryonic cell of the animal, prior to development of the animal. 

iJ 

20 A transgenic animal of the invention can be created by introducing FCTRX-encoding 

nucleic acid into the male pronuclei of a fertilized oocyte (e.g., by microinjection, retroviral 
C~l infection) and allowing the oocyte to develop in a pseudopregnant female foster animal. The 

human FCTRX cDNA sequences of SEQ ID N0S:1, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 
24, can be introduced as a transgene into the genome of a non-human animal. Alternatively, a 

25 non-human homologue of the human FCTRX gene, such as a mouse FCTRX gene, can be 

isolated based on hybridization to the human FCTRX cDNA (described further supra) and used 
as a transgene. Intronic sequences and polyadenylation signals can also be included in the 
transgene to increase the efficiency of expression of the transgene. A tissue-specific regulatory 
sequence(s) can be operably-linked to the FCTRX transgene to direct expression of FCTRX 

30 protein to particular cells. Methods for generating transgenic animals via embryo manipulation 
and microinjection, particularly animals such as mice, have become conventional in the art and 
are described, for example, in U.S. Patent Nos. 4,736,866; 4,870,009; and 4,873,191; and Hogan, 
1986. In: Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N. Y. Similar methods are used for production of other transgenic animals. A 
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transgenic founder anjj^ can be identified based upon the pr^^^e of the FCTRX transgene in 
its genome and/or ex^Kion of FCTRX niRNA in tissues or cells of the animals. A transgenic 
founder animal can then be used to breed additional animals carrying the transgene. Moreover, 
transgenic animals carrying a transgene-encoding FCTRX protein can further be bred to other ^ 
5 transgenic animals carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains at least 
a portion of an FCTRX gene into which a deletion, addition or substitution has been introduced 
to thereby alter, e.g., functionally disrupt, the FCTRX gene. The FCTRX gene can be a human 
gene (e.g., the cDNA of SEQ ID N0S;1, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24), but 

10 more preferably, is a non-human homologue of a human FCTRX gene. For example, a mouse 
homologue of human FCTRX gene of SEQ ID N0S:1, 3, 5, 7, 9, 10, 11, 12, 14, 16, 18, 20, 22, 
and 24, can be used to construct a homologous recombination vector suitable for altering an 
endogenous FCTRX gene in the mouse genome. In one embodiment, the vector is designed such 
that, upon homologous recombination, the endogenous FCTRX gene is functionally disrupted 

15 (/.e., no longer encodes a functional protein; also referred to as a "knock out" vector). 

Altematively, the vector can be designed such that, upon homologous recombination, the 
endogenous FCTRX gene is mutated or otherwise altered but still encodes functional protein 
(e.g., the upstream regulatory region can be altered to thereby alter the expression of the 
endogenous FCTRX protein). In the homologous recombination vector, the altered portion of 

20 the FCTRX gene is flanked at its 5'- and 3*-termini by additional nucleic acid of the FCTRX gene 
to allow for homologous recombination to occur between the exogenous FCTRX gene carried by 
the vector and an endogenous FCTRX gene in an embryonic stem cell. The additional flanking 
FCTRX nucleic acid is of sufficient length for successful homologous recombination with the 
endogenous gene. Typically, several kilobases of flanking DNA (both at the 5'- and 3*-termini) 

25 are included in the vector. iSee, e.g., Thomas, et a/., 1987. Cell 51: 503 for a description of 
homologous recombination vectors. The vector is ten introduced into an embryonic stem cell 
line (e.g., by electroporation) and cells in which the introduced FCTRX gene has homologously- 
recombined with the endogenous FCTRX gene are selected. See, e.g., Li, et al, 1992. Cell 69: 
915. 

30 The selected cells are then injected into a blastocyst of an animal (e.g., a mouse) to form 

aggregation chimeras. See, e.g., Bradley, 1987. In: Teratocarcinomas and Embryonic Stem 
Cells: A Practical Approach, Robertson, ed. IRL, Oxford, pp. 1 13-152. A chimeric embryo 
can then be implanted into a suitable pseudopregnant female foster animal and the embryo 
brought to term. Progeny harboring the homologously-recombined DNA in their germ cells can 
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which all cells of the animal contaii 




Lomologously 



-recombined 



DNA by germline transfusion of the transgene. Methods for constructing homologous 
recombination vectors and homologous recombinant animals are described further in Bradley, 
1991. Cwrr. Opin, Biotechnol 2: 823-829; PCT Intemational Publication Nos.: WO 90/1 1354; 
WO 91/01 140; WO 92/0968; and WO 93/04169. 

In another embodiment, transgenic non-humans animals can be produced that contain 
selected systems that allow for regulated expression of the transgene. One example of such a 
system is the cre/loxP recombinase system of bacteriophage PI . For a description of the cre/loxP 
recombinase system. See, e.g., Lakso, etal, 1992. Proc, Natl Acad, Set USA 89: 6232-6236. 
Another example of a recombinase system is the FLP recombinase system of Saccharomyces 
cerevisiae. See, O'Gorman, et al,, 1991. 5c/ertce 251:1351-1355. If a cre/loxP recombinase 
system is used to regulate expression of the transgene, animals containing transgenes encoding 
both the Cre recombinase and a selected protein are required. Such animals can be provided 



through the construction of "double" transgenic animals, e.g., by mating two transgenic animals, 
one containing a transgene encoding a selected protein and the other containing a transgene 
encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut, et al,, 1997. Nature 385: 810-813. In brief, a cell 
(e.g., a somatic cell) from the transgenic animal can be isolated and induced to exit the growth 
cycle and enter Go phase. The quiescent cell can then be fiised, e.g., through the use of electrical 
pulses, to an enucleated oocyte from an animal of the same species from which the quiescent cell 
is isolated. The reconstructed oocyte is then cultured such that it develops to morula or 
blastocyte and then transferred to pseudopregnant female foster animal. The offspring borne of 
this female foster animal will be a clone of the animal from which the cell (e.g., the somatic cell) 
is isolated. 

Pharmaceutical Compositions 

The FCTRX nucleic acid molecules, FCTRX proteins, and anti-FCTRX antibodies (also 
referred to herein as "active compounds") of the invention, and derivatives, fragments, analogs 
and homologs thereof, can be incorporated into pharmaceutical compositions suitable for 
administration. Such compositions typically comprise the nucleic acid molecule, protein, or 
antibody and a pharmaceutically acceptable carrier. As used herein, "pharmaceutically 
acceptable carrier" is intended to include any and all solvents, dispersion media, coatings, 
antibacterial and antifimgal agents, isotonic and absorption delaying agents, and the like, 
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compatible with ph; 




futical administration. Suitable carrii 




:e described in the most 



recent edition of Remin|Rn's Pharmaceutical Sciences, a standara reference text in the field, 
which is incorporated herein by reference. Preferred examples of such carriers or diluents 
include, but are not limited to, water, saline, finger's solutions, dextrose solution, and 5% human 
serum albumin. Liposomes and non-aqueous vehicles such as fixed oils may also be used. The 
use of such media and agents for pharmaceutically active substances is well known in the art. 
Except insofar as any conventional media or agent is incompatible v^th the active compound, 
use thereof in the compositions is contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible with its 
intended route of administration. Examples of routes of administration include parenteral, e,g., 
intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal (/.e., topical), 
transmucosal, and rectal administration. Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the following components: a sterile diluent 
such as water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene 
glycol or other synthetic solvents; antibacterial agents such as benzyl alcohol or methyl 
parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents such as 
ethylenediaminetetraacetic acid (EDTA); buffers such as acetates, citrates or phosphates, and 
agents for the adjustment of tonicity such as sodium chloride or dextrose. The pH can be 
adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in ampoules, disposable syringes or multiple dose vials made of 
glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions 
(where water soluble) or dispersions and sterile powders for the extemporaneous preparation of 
sterile injectable solutions or dispersion. For intravenous administration, suitable carriers 

TM 

include physiological saline, bacteriostatic water, Cremophor EL (BASF, Parsippany, N.J.) or 
phosphate buffered saline (PBS). In all cases, the composition must be sterile and should be 
fluid to the extent that easy syringeability exists. It must be stable under the conditions of 
manufacture and storage and must be preserved against the contaminating action of 
microorganisms such as bacteria and fungi. The carrier can be a solvent or dispersion medium 
containing, for example, water, ethanol, polyol (for example, glycerol, propylene glycol, and 
liquid polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity can 
be maintained, for example, by the use of a coating such as lecithin, by the maintenance of the 
required particle size in the case of dispersion and by the use of surfactants. Prevention of the 
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action of microorganisn^^an be achieved by various antibacter^fcnd antifungal agents, for 
example, parabens, chlSWutanol, phenol, ascorbic acid, thimero^ and the like. In many 
cases, it will be preferable to include isotonic agents, for example, sugars, polyalcohols such as 
manitol, sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable 
compositions can be brought about by including in the composition an agent which delays 
absorption, for example, alimiinum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound (e.g., 
an FCTRX protein or anti-FCTRX antibody) in the required amount in an appropriate solvent 
v^th one or a combination of ingredients enumerated above, as required, followed by filtered 
sterilization. Generally, dispersions are prepared by incorporating the active compound into a 
sterile vehicle that contains a basic dispersion medium and the required other ingredients from 
those enumerated above. In the case of sterile powders for the preparation of sterile injectable 
solutions, methods of preparation are vacuum drying and freeze-drying that yields a powder of 
the active ingredient plus any additional desired ingredient from a previously sterile-filtered 
solution thereof 

Oral compositions generally include an inert diluent or an edible carrier. They can be 
enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic 
administration, the active compound can be incorporated with excipients and used in the form of 
tablets, troches, or capsules. Oral compositions can also be prepared using a fluid carrier for use 
as a mouthwash, wherein the compound in the fluid carrier is applied orally and swished and 
expectorated or swallowed. Pharmaceutically compatible binding agents, and/or adjuvant 
materials can be included as part of the composition. The tablets, pills, capsules, troches and the 
like can contain any of the following ingredients, or compounds of a similar nature: a binder 
such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel, or com starch; a lubricant such as 
magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a sweetening agent 
such as sucrose or saccharin; or a flavoring agent such as peppermint, methyl salicylate, or 
orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an aerosol 
spray from pressured container or dispenser which contains a suitable propellant, e.g., a gas such 
as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be permeated 
are used in the formulation. Such penetrants are generally known in the art, and include, for 
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example, for transmuc^^administration, detergents, bile salts^Bd fusidic acid derivatives. 
Transmucosal adminis«ron can be accomplished through the i^^f nasal sprays or 
suppositories. For transdermal administration, the active compounds are formulated into 
ointments, salves, gels, or creams as generally known in the art. 

The compounds can also be prepared in the form of suppositories (e.g., with conventional 
suppository bases such as cocoa butter and other glycerides) or retention enemas for rectal 
delivery. 

In one embodiment, the active compounds are prepared with carriers that v^ll protect the 
compound against rapid elimination from the body, such as a controlled release formulation, 
including implants and microencapsulated delivery systems. Biodegradable, biocompatible 
polymers can be used, such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, 
collagen, poiyorthoesters, and polylactic acid. Methods for preparation of such formulations will 
be apparent to those skilled in the art. The materials can also be obtained conmiercially from 
Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal suspensions (including liposomes 
targeted to infected cells with monoclonal antibodies to viral antigens) can also be used as 
pharmaceutically acceptable carriers. These can be prepared according to methods known to 
those skilled in the art, for example, as described in U.S. Patent No. 4,522,8 11. 

It is especially advantageous to formulate oral or parenteral compositions in dosage unit 
form for ease of administration and uniformity of dosage. Dosage unit form as used herein refers 
to physically discrete imits suited as unitary dosages for the subject to be treated; each unit 
containing a predetermined quantity of active compound calculated to produce the desired 
therapeutic effect in association with the required pharmaceutical carrier. The specification for 
the dosage unit forms of the invention are dictated by and directly dependent on the unique 
characteristics of the active compound and the particular therapeutic effect to be achieved, and 
the limitations inherent in the art of compounding such an active compound for the treatment of 
individuals. 

The nucleic acid molecules of the invention can be inserted into vectors and used as gene 
therapy vectors. Gene therapy vectors can be delivered to a subject by, for example, intravenous 
injection, local administration (see, e.g., U.S. Patent No. 5,328,470) or by stereotactic injection 
{see, e.g., Chen, et al, 1994. Proc. Natl Acad. ScL USA 91: 3054-3057). The pharmaceutical 
preparation of the gene therapy vector can include the gene therapy vector in an acceptable 
diluent, or can comprise a slow release matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector can be produced intact from recombinant 
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cells, e.g., retroviral vi 




•s, the pharmaceutical preparation c; 




ude one or more cells that 



produce the gene delive^system. 

The pharmaceutical compositions can be included in a container, pack, or dispenser 
together with instructions for administration. 

Screening and Detection Methods 

The isolated nucleic acid molecules of the invention can be used to express FCTRX 
protein (e.g., via a recombinant expression vector in a host cell in gene therapy applications), to 
detect FCTRX mRNA (e.g., in a biological sample) or a genetic lesion in an FCTRX gene, and 
to modulate FCTRX activity, as described further, below. In addition, the FCTRX proteins can 
be used to screen drugs or compounds that modulate the FCTRX protein activity or expression as 
well as to treat disorders characterized by insufficient or excessive production of FCTRX protein 
or production of FCTRX protein forms that have decreased or aberrant activity compared to 
FCTRX v^ld-type protein (e.g.; diabetes (regulates insulin release); obesity (binds and transport 
lipids); metabolic disturbances associated with obesity, the metabolic syndrome X as well as 
anorexia and wasting disorders associated with chronic diseases and various cancers, and 
infectious disease(possesses anti-microbial activity) and the various dyslipidemias. In addition, 
the anti-FCTRX antibodies of the invention can be used to detect and isolate FCTRX proteins 
and modulate FCTRX activity. In yet a further aspect, the invention can be used in methods to 
influence appetite, absorption of nutrients and the disposition of metabolic substrates in both a 
positive and negative fashion. 

The invention further pertains to novel agents identified by the screening assays 
described herein and uses thereof for treatments as described, supra. 

Screening Assays 

The invention provides a method (also referred to herein as a "screening assay") for 
identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, 
peptidomimetics, small molecules or other drugs) that bind to FCTRX proteins or have a 
stimulatory or inhibitory effect on, e.g., FCTRX protein expression or FCTRX protein activity. 
The invention also includes compounds identified in the screening assays described herein. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of the membrane-bound form of an FCTRX 
protein or polypeptide or biologically-active portion thereof The test compounds of the 
invention can be obtained using any of the numerous approaches in combinatorial library 
methods known in the art, including: biological libraries; spatially addressable parallel solid 
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phase or solution phas< 




■aries; synthetic library methods 



reqi 




deconvolution; the 



"one-bead one-corapouHr library method; and synthetic library methods using affinity 
chromatography selection. The biological library approach is limited to peptide libraries, while 
the other four approaches are applicable to peptide, non-peptide oligomer or small molecule 
libraries of compounds. See, e.g., Lam, \991 . Anticancer Drug Design 12: 145. 

A "small molecule" as used herein, is meant to refer to a composition that has a molecular 
weight of less than about 5 kD and most preferably less than about 4 kD. Small molecules can 
be, e.g., nucleic acids, peptides, polypeptides, peptidomimetics, carbohydrates, lipids or other 
organic or inorganic molecules. Libraries of chemical and/or biological mixtures, such as fungal, 
bacterial, or algal extracts, are known in the art and can be screened with any of the assays of the 
invention. 

Examples of methods for the synthesis of molecular libraries can be found in the art, for 
example in: DeWitt, et a/., 1993. Proc. Natl Acad. Sci. U.S.A. 90: 6909; Erb, et al, 1994. Proc. 
Natl Acad. Sci. U.S.A. 91: 11422; Zuckermann, etal,, 1994. J. Med Chem. 37: 2678; Cho, et al, 
1993. Science 261 : 1303; Carrell, et a/., 1994. Angew. Chem. Int. Ed. Engl 33: 2059; Carell, et 
al, 1994. Angew. Chem. Int. Ed. Engl 33: 2061; and Gallop, et al, 1994. J. Med Chem. 37: 
1233. 

Libraries of compounds may be presented in solution {e.g., Houghten, 1992. 
Biotechniques 13: 412-421), or on beads (Lam, \99\. Nature 354: 82-84), on chips (Fodor, 1993. 
Nature 364: 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), spores (Ladner, U.S. Patent 
5,233,409), plasmids (Cull, et al, 1992. Proc. Natl Acad Sci. USA 89: 1865-1869) or on phage 
(Scott and Smith, 1990. Science 249: 386-390; Devlin, 1990. Science 249: 404-406; Cwiria, et 
al, 1990. Proc. Natl Acad Set U.S.A. 87: 6378-6382; Felici, 1991. J. Mol Biol 222: 301-310; 
Ladner, U.S. Patent No. 5,233,409.). 

In one embodiment, an assay is a cell-based assay in which a cell which expresses a 
membrane-bound form of FCTRX protein, or a biologically-active portion thereof, on the cell 
surface is contacted with a test compound and the ability of the test compound to bind to an 
FCTRX protein determined. The cell, for example, can of mammalian origin or a yeast cell. 
Determining the ability of the test compound to bind to the FCTRX protein can be accomplished, 
for example, by coupling the test compound with a radioisotope or enzymatic label such that 
binding of the test compound to the FCTRX protein or biologically-active portion thereof can be 
determined by detecting the labeled compound in a complex. For example, test compounds can 
be labeled with ^^^I, ^^S, ^"^C, or ^H, either directly or indirectly, and the radioisotope detected by 
direct counting of radioemission or by scintillation counting. Alternatively, test compounds can 
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be enzymatically-labek^|dth, for example, horseradish peroxi^^^ alkaline phosphatase, or 
luciferase, and the en2^Btic label detected by determination of conversion of an appropriate 
substrate to product. In one embodiment, the assay comprises contacting a cell which expresses 
a membrane-bound form of FCTRX protein, or a biologically-active portion thereof, on the cell 
surface with a known compound which binds FCTRX to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the ability of the test compound to interact 
with an FCTRX protein, wherein determining the ability of the test compound to interact v^th an 
FCTRX protein comprises determining the ability of the test compound to preferentially bind to 
FCTRX protein or a biologically-active portion thereof as compared to the known compound. 

In another embodiment, an assay is a cell-based assay comprising contacting a cell 
expressing a membrane-bound form of FCTRX protein, or a biologically-active portion thereof, 
on the cell surface v^th a test compound and determining the ability of the test compound to 
modulate (e.g., stimulate or inhibit) the activity of the FCTRX protein or biologically-active 
portion thereof. Determining the ability of the test compound to modulate the activity of FCTRX 
or a biologically-active portion thereof can be accomplished, for example, by determining the 
ability of the FCTRX protein to bind to or interact with an FCTRX target molecule. As used 
herein, a "target molecule" is a molecule with which an FCTRX protein binds or interacts in 
nature, for example, a molecule on the surface of a cell which expresses an FCTRX interacting 
protein, a molecule on the surface of a second cell, a molecule in the extracellular milieu, a 
molecule associated v^th the internal surface of a cell membrane or a cytoplasmic molecule. An 
FCTRX target molecule can be a non-FCTRX molecule or an FCTRX protein or polypeptide of 
the invention. In one embodiment, an FCTRX target molecule is a component of a signal 
transduction pathway that facilitates transduction of an extracellular signal {e.g, a signal 
generated by binding of a compound to a membrane-bound FCTRX molecule) through the cell 
membrane and into the cell. The target, for example, can be a second intercellular protein that 
has catalytic activity or a protein that facilitates the association of downstream signaling 
molecules with FCTRX. 

Determining the ability of the FCTRX protein to bind to or interact with an FCTRX 
target molecule can be accomplished by one of the methods described above for determining 
direct binding. In one embodiment, determining the ability of the FCTRX protein to bind to or 
interact with an FCTRX target molecule can be accomplished by determining the activity of the 
target molecule. For example, the activity of the target molecule can be determined by detecting 
induction of a cellular second messenger of the target {Le. intracellular Ca^^, diacylglycerol, IP3, 
etc.), detecting catalytic/enzymatic activity of the target an appropriate substrate, detecting the 
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induction of a reporter 




(comprising an FCTRX-responsive. 



[latory element operatively 



linked to a nucleic acid^lrcbding a detectable marker, e.g,, luciferase), or detecting a cellular 
response, for example, cell survival, cellular differentiation, or cell proliferation. 

In yet another embodiment, an assay of the invention is a cell-free assay comprising 
contacting an FCTRX protein or biologically-active portion thereof v^th a test compound and 
determining the ability of the test compound to bind to the FCTRX protein or biologically-active 
portion thereof Binding of the test compound to the FCTRX protein can be determined either 
directly or indirectly as described above. In one such embodiment, the assay comprises 
contacting the FCTRX protein or biologically-active portion thereof with a knovm compound 
which binds FCTRX to form an assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test compound to interact v/ith an FCTRX protein, 
wherein determining the ability of the test compound to interact v^th an FCTRX protein 
comprises determining the ability of the test compound to preferentially bind to FCTRX or 
biologically-active portion thereof as compared to the known compound. 

In still another embodiment, an assay is a cell-free assay comprising contacting FCTRX 
protein or biologically-active portion thereof with a test compound and determining the ability of 
the test compound to modulate (e.g, stimulate or inhibit) the activity of the FCTRX protein or 
biologically-active portion thereof. Determining the ability of the test compound to modulate the 
activity of FCTRX can be accomplished, for example, by determining the ability of the FCTRX 
protein to bind to an FCTRX target molecule by one of the methods described above for 
determining direct binding. In an alternative embodiment, determining the ability of the test 
compound to modulate the activity of FCTRX protein can be accomplished by determining the 
ability of the FCTRX protein further modulate an FCTRX target molecule. For example, the 
catalytic/enzymatic activity of the target molecule on an appropriate substrate can be determined 
as described, supra. 

In yet another embodiment, the cell-free assay comprises contacting the FCTRX protein 
or biologically-active portion thereof with a known compound which binds FCTRX protein to 
form an assay mixture, contacting the assay mixture with a test compound, and determining the 
ability of the test compound to interact vnth an FCTRX protein, wherein determining the ability 
of the test compound to interact with an FCTRX protein comprises determining the ability of the 
FCTRX protein to preferentially bind to or modulate the activity of an FCTRX target molecule. 

The cell-free assays of the invention are amenable to use of both the soluble form or the 
membrane-bound form of FCTRX protein. In the case of cell-free assays comprising the 
membrane-bound form of FCTRX protein, it may be desirable to utilize a solubilizing agent such 
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that the membrane-bou|^orm of FCTRX protein is maintained^kolution. Examples of such 
solubilizing agents incHw non-ionic detergents such as n-octylgracoside, n-dodecylglucoside, 
n-dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N-methylglucamide, Tnton 
X-100, Triton® X-1 14, Thesit®, Isotridecypoly(ethylene glycol ether)n, N-dodecyl— 
5 N,N-dimethyl-3-ammomo-l -propane sulfonate, 3-(3-cholamidopropyl) dimethylamminiol- 
1 -propane sulfonate (CHAPS), or 3-(3-cholanudopropyl)dimethylaniminiol-2-hydroxy- 
1 -propane sulfonate (CHAPSO). 

In more than one embodiment of the above assay methods of the invention, it may be 
desirable to immobilize either FCTRX protein or its target molecule to facilitate separation of 
10 complexed from uncomplexed forms of one or both of the proteins, as well as to accommodate 
automation of the assay. Binding of a test compound to FCTRX protein, or interaction of 
FCTRX protein with a target molecule in the presence and absence of a candidate compound, 
can be accomplished in any vessel suitable for containing the reactants. Examples of such 
vessels include microtiter plates, test tubes, and micro-centrifuge tubes. In one embodiment, a 
vlI 1 5 fusion protein can be provided that adds a domain that allows one or both of the proteins to be 
f=J bound to a matrix. For example, GST-FCTRX fusion proteins or GST-target fusion proteins can 

1==^ be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 

u| derivatized microtiter plates, that are then combined with the test compound or the test 

fri 

l''^ compound and either the non-adsorbed target protein or FCTRX protein, and the mixture is 

P; 20 incubated under conditions conducive to complex formation {e.g., at physiological conditions for 
n sah and pH). Follov^ng incubation, the beads or microtiter plate wells are washed to remove any 

^ unbound components, the matrix immobilized in the case of beads, complex determined either 

directly or indirectly, for example, as described, supra. Alternatively, the complexes can be 
dissociated from the matrix, and the level of FCTRX protein binding or activity determined 
25 using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the screening 
assays of the invention. For example, either the FCTRX protein or its target molecule can be 
immobilized utilizing conjugation of biotin and streptavidin. Biotinylated FCTRX protein or 
target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using techniques 
30 well-known within the art {e.g., biotinylation kit. Pierce Chemicals, Rockford, 111.), and 

immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, 
antibodies reactive with FCTRX protein or target molecules, but which do not interfere with 
binding of the FCTRX protein to its target molecule, can be derivatized to the wells of the plate, 
and unbound target or FCTRX protein trapped in the wells by antibody conjugation. Methods 
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for detecting such com] 
complexes, include imi 




protein or target molecule, as well as enzyme-linked assays that rely on detecting an enzymatic 
activity associated with the FCTRX protein or target molecule. 

In another embodiment, modulators of FCTRX protein expression are identified in a 
method wherein a cell is contacted with a candidate compound and the expression of FCTRX 
mRNA or protein in the cell is determined. The level of expression of FCTRX mRNA or protein 
in the presence of the candidate compound is compared to the level of expression of FCTRX 
mRNA or protein in the absence of the candidate compound. The candidate compound can then 
be identified as a modulator of FCTRX mRNA or protein expression based upon this 
comparison. For example, when expression of FCTRX mRNA or protein is greater (i.e., 
statistically significantly greater) in the presence of the candidate compoxmd than in its absence, 
the candidate compound is identified as a stimulator of FCTRX mRNA or protein expression. 
Alternatively, when expression of FCTRX mRNA or protein is less (statistically significantly 
less) in the presence of the candidate compound than in its absence, the candidate compound is 
identified as an inhibitor of FCTRX mRNA or protein expression. The level of FCTRX mRNA 
or protein expression in the cells can be determined by methods described herein for detecting 
FCTRX mRNA or protein. 

In yet another aspect of the invention, the FCTRX proteins can be used as "bait proteins" 
in a two-hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 5,283,3 1 7; Zervos, et aL, 
1993. Cell 72: 223-232; Madura, et al, 1993. J. Biol Chem, 268: 12046-12054; Bartel, et al, 
1993. Biotechniques 14: 920-924; Iwabuchi, etal, 1993. Oncogene 8: 1693-1696; and Brent 
WO 94/10300), to identify other proteins that bind to or interact with FCTRX ("FCTRX-binding 
proteins" or "FCTRX-bp") and modulate FCTRX activity. Such FCTRX-binding proteins are 
also likely to be involved in the propagation of signals by the FCTRX proteins as, for example, 
upstream or downstream elements of the FCTRX pathway. 

The two-hybrid system is based on the modular nature of most transcription factors, 
which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes two 
different DNA constructs. In one construct, the gene that codes for FCTRX is fused to a gene 
encoding the DNA binding domain of a known transcription factor {e.g., GAL-4). In the other 
construct, a DNA sequence, from a library of DNA sequences, that encodes an unidentified 
protein ("prey" or "sample") is fused to a gene that codes for the activation domain of the known 
transcription factor. If the "bait" and the "prey" proteins are able to interact, in vivo, forming an 
FCTRX-dependent complex, the DNA-binding and activation domains of the transcription factor 
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tfi of a reporter gene (e.g., 
►onsive to the transcription 



factor. Expression of the reporter gene can be detected and cell colonies containing the 



protein which interacts with FCTRX. 

The invention further pertains to novel agents identified by the aforementioned screening 
assays and uses thereof for treatments as described herein. 

Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the corresponding 
complete gene sequences) can be used in numerous ways as polynucleotide reagents. By way of 
example, and not of limitation, these sequences can be used to: (/) map their respective genes on 
a chromosome; and, thus, locate gene regions associated with genetic disease; (//) identify an 
individual from a minute biological sample (tissue typing); and (///) aid in forensic identification 
of a biological sample. Some of these applications are described in the subsections, below. 

Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
sequence can be used to map the location of the gene on a chromosome. This process is called 
chromosome mapping. Accordingly, portions or fragments of the FCTRX sequences, SEQ ID 
N0S:1, 3, 5, 7, 9, 10, 11, 12, 14, 16, 18, 20, 22, and 24, or fragments or derivatives thereof, can 
be used to map the location of the FCTRX genes, respectively, on a chromosome. The mapping 
of the FCTRX sequences to chromosomes is an important first step in correlating these 
sequences with genes associated with disease. 

Briefly, FCTRX genes can be mapped to chromosomes by preparing PGR primers 
(preferably 1 5-25 bp in length) from the FCTRX sequences. Computer analysis of the FCTRX, 
sequences can be used to rapidly select primers that do not span more than one exon in the 
genomic DNA, thus complicating the amplification process. These primers can then be used for 
PCR screening of somatic cell hybrids containing individual human chromosomes. Only those 
hybrids containing the human gene corresponding to the FCTRX sequences will yield an 
amplified fragment. 

Somatic cell hybrids are prepared by fusing somatic cells from different mammals (e.g., 
human and mouse cells). As hybrids of human and mouse cells grow and divide, they gradually 
lose human chromosomes in random order, but retain the mouse chromosomes. By using media 
in which mouse cells cannot grow, because they lack a particular enzyme, but in which human 



functional transcription factor can be isolated and used to obtain the cloned gene that encodes the 
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cells can, the one humaj^hromosome that contains the gene en^^big the needed enzyme will 
be retained. By using ^Rus media, panels of hybrid cell lines ran be established. Each cell 
line in a panel contains either a single human chromosome or a small number of human 
chromosomes, and a full set of mouse chromosomes, allowing easy mapping of individual genes 
to specific human chromosomes. See, e.g,, D'Eustachio, et al., 1983. Science 220: 919-924. 
Somatic cell hybrids containing only fragments of human chromosomes can also be produced by 
using human chromosomes with translocations and deletions. 

PGR mapping of somatic cell hybrids is a rapid procedure for assigning a particular 
sequence to a particular chromosome. Three or more sequences can be assigned per day using a 
single thermal cycler. Using the FCTRX sequences to design oligonucleotide primers, sub- 
localization can be achieved with panels of fragments from specific chromosomes. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can fiirther be used to provide a precise chromosomal location in one step. 
Chromosome spreads can be made using cells whose division has been blocked in metaphase by 
a chemical like colcemid that disrupts the mitotic spindle. The chromosomes can be treated 
briefly with trypsin, and then stained with Giemsa. A pattern of light and dark bands develops 
on each chromosome, so that the chromosomes can be identified individually. The FISH 
technique can be used with a DNA sequence as short as 500 or 600 bases. However, clones 
larger than 1,000 bases have a higher likelihood of binding to a unique chromosomal location 
with sufficient signal intensity for simple detection. Preferably 1,000 bases, and more preferably 
2,000 bases, will suffice to get good results at a reasonable amount of time. For a review of this 
technique, see, Verma, et al. Human Chromosomes: A Manual of Basic Techniques 
(Pergamon Press, New York 1988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for marking 
multiple sites and/or multiple chromosomes. Reagents corresponding to noncoding regions of 
the genes actually are preferred for mapping purposes. Coding sequences are more likely to be 
conserved within gene families, thus increasing the chance of cross hybridizations during 
chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the physical 
position of the sequence on the chromosome can be correlated with genetic map data. Such data 
are found, e.g., in McKusick, Mendelian INHERITANCE IN Man, available on-line through Johns 
Hopkins University Welch Medical Library). The relationship between genes and disease, 
mapped to the same chromosomal region, can then be identified through linkage analysis 
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(co-inheritance of phys^^ly adjacent genes), described in, e,g,,Mmg\md, et al, 1987. Nature, 
325: 783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected v^th a disease associated with the FCTRX gene, can be determined. If a mutation is 
observed in some or all of the affected individuals but not in any unaffected individuals, then the 
mutation is likely to be the causative agent of the particular disease. Comparison of affected and 
unaffected individuals generally involves first looking for structural alterations in the 
chromosomes, such as deletions or translocations that are visible from chromosome spreads or 
detectable using PGR based on that DNA sequence. Ultimately, complete sequencing of genes 
from several individuals can be performed to confirm the presence of a mutation and to 
distinguish mutations from polymorphisms. 

Tissue Typing 

The FCTRX sequences of the invention can also be used to identify individuals from 
minute biological samples. In this technique, an individual's genomic DNA is digested with one 
or more restriction enzymes, and probed on a Southern blot to yield unique bands for 
identification. The sequences of the invention are useful as additional DNA markers for RFLP 
("restriction fragment length polymorphisms," described in U.S. Patent No. 5,272,057). 

Furthermore, the sequences of the invention can be used to provide an alternative 
technique that determines the actual base-by-base DNA sequence of selected portions of an 
individual's genome. Thus, the FCTRX sequences described herein can be used to prepare two 
PCR primers from the 5'- and 3 '-termini of the sequences. These primers can then be used to 
amplify an individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from individuals, prepared in this manner, can 
provide unique individual identifications, as each individual will have a unique set of such DNA 
sequences due to allelic differences. The sequences of the invention can be used to obtain such 
identification sequences from individuals and from tissue. The FCTRX sequences of the 
invention uniquely represent portions of the human genome. Allelic variation occurs to some 
degree in the coding regions of these sequences, and to a greater degree in the noncoding 
regions. It is estimated that allelic variation between individual humans occurs with a frequency 
of about once per each 500 bases. Much of the allelic variation is due to single nucleotide 
polymorphisms (SNPs), which include restriction fragment length polymorphisms (RFLPs). 

Each of the sequences described herein can, to some degree, be used as a standard against 
which DNA from an individual can be compared for identification purposes. Because greater 
numbers of polymorphisms occur in the noncoding regions, fewer sequences are necessary to 
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provide positive individual 
ield a noncoding amplified 



sequence of 100 bases. If predicted coding sequences, such as those in SEQ ID NOS: 1 , 3, 5, 7, 
9, 10, 11, 12, 14, 16, 18, 20, 22, and 24, are used, a more appropriate number of primers for 
positive individual identification would be 500-2,000. 
Predictive Medicine 

The invention also pertains to the field of predictive medicine in which diagnostic assays, 
prognostic assays, pharmacogenomics, and monitoring clinical trials are used for prognostic 
(predictive) purposes to thereby treat an individual prophylactically. Accordingly, one aspect of 
the invention relates to diagnostic assays for determining FCTRX protein and/or nucleic acid 
expression as well as FCTRX activity, in the context of a biological sample (e.g., blood, serum, 
cells, tissue) to thereby determine whether an individual is afflicted with a disease or disorder, or 
is at risk of developing a disorder, associated with aberrant FCTRX expression or activity. The 
disorders include Also v^thin the scope of the invention is the use of a Therapeutic in the 
manufacture of a medicament for treating or preventing disorders or syndromes including, e.g., 
Colorectal cancer, adenomatous polyposis coli, myelogenous leukemia, congenital ceonatal 
alloimmune thrombocytopenia, multiple human solid malignancies, malignant ovarian tumours 
particularly at the interface between epithelia and stroma, malignant brain tumors, mammary 
tumors, human gliomas, astrocytomas, mixed glioma/astrocytomas, renal cells carcinoma, breast 
adenocarcinoma, ovarian cancer, melanomas, renal cell carcinoma , clear cell and granular cell 
carcinomas, autocrine/paracrine stimulation of tumor cell proliferation, autocrine/paracrine 
stimulation of tumor cell survival and tumor cell resistance to cytotoxic therapy, paranechmal 
and basement membrane invasion and motility of tumor cells thereby contributing to metastasis, 
tumor-mediated immunosuppression of T-cell mediated immune effector cells and pathways 
resulting in timior escape fi*om immune surveilance, neurological disorders, neurodegenerative 
disorders, nerve trauma, familial myelodysplastic syndrome, Charcot-Marie-Tooth neuropathy, 
demyelinating Gardner syndrome, familial myelodysplastic syndrome; mental health conditions, 
immunological disorders, allergy and infection, asthma, bronchial asthma, Avellino type 
eosinophilia, lung diseases, reproductive disorders, male infertility, female reproductive system 
disorders, male and female reproductive diseases, hemangioma, deafness, glycoprotein la 
deficiency, desmoid disease, turcot syndrome, liver cirrhosis, hepatitis C, gastric disorders, 
pancreatic diseases like diabetes. Schistosoma mansoni infection. Spinocerebellar ataxia, 
Plasmodium falciparum parasitemia. Corneal dystrophy -Groenouw type I, Corneal dystrophy - 
lattice type I, and Reis-Bucklers corneal dystrophy. The invention also provides for prognostic 
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(or predictive) assays Ajj^etermining whether an individual is^^sk of developing a disorder 
associated with FCTI^^rotein, nucleic acid expression or activity. For example, mutations in 
an FCTRX gene can be assayed in a biological sample. Such assays can be used for prognostic 
or predictive purpose to thereby prophylactically treat an individual prior to the onset of a 
5 disorder characterized by or associated with FCTRX protein, nucleic acid expression, or 
biological activity. 

Another aspect of the invention provides methods for determining FCTRX protein, 
nucleic acid expression or activity in an individual to thereby select appropriate therapeutic or 
prophylactic agents for that individual (referred to herein as "pharmacogenomics"). 
10 Pharmacogenomics allows for the selection of agents (e,g., drugs) for therapeutic or prophylactic 
treatment of an individual based on the genotype of the individual (e.g., the genotype of the 
individual examined to determine the ability of the individual to respond to a particular agent.) 

Yet another aspect of the invention pertains to monitoring the influence of agents (e.g., 
drugs, compounds) on the expression or activity of FCTRX in clinical trials. 

ul 15 These and other agents are described in further detail in the following sections. 

fri 

III Diagnostic Assays 

i'J An exemplary method for detecting the presence or absence of FCTRX in a biological 

^ sample involves obtaining a biological sample from a test subject and contacting the biological 

sample with a compound or an agent capable of detecting FCTRX protein or nucleic acid {e.g., 
CI 20 mRNA, genomic DNA) that encodes FCTRX protein such that the presence of FCTRX is 
f=| detected in the biological sample. An agent for detecting FCTRX mRNA or genomic DNA is a 

^ labeled nucleic acid probe capable of hybridizing to FCTRX mRNA or genomic DNA. The 

nucleic acid probe can be, for example, a full-length FCTRX nucleic acid, such as the nucleic 
acid of SEQ ID N0S:1, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24, or a portion thereof, such 
25 as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient 
to specifically hybridize under stringent conditions to FCTRX mRNA or genomic DNA. Other 
suitable probes for use in the diagnostic assays of the invention are described herein. 

An agent for detecting FCTRX protein is an antibody capable of binding to FCTRX 
protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, or more 
30 preferably, monoclonal. An intact antibody, or a fragment thereof (e.g.. Fab or F(ab')2) can be 
used. The term "labeled", with regard to the probe or antibody, is intended to encompass direct 
labeling of the probe or antibody by coupling (/.e., physically linking) a detectable substance to 
the probe or antibody, as well as indirect labeling of the probe or antibody by reactivity with 
another reagent that is directly labeled. Examples of indirect labeling include detection of a 
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iuorescently-labeled secondary antib< 




id end-labeling of a DNA 



probe with biotin such iHn it can be detected with fluorescently-Iabeled streptavidin. The term 
"biological sample" is intended to include tissues, cells and biological fluids isolated from a 
subject, as well as tissues, cells and fluids present within a subject. That is, the detection method 
of the invention can be used to detect FCTRX mRNA, protein, or genomic DNA in a biological 
sample in vitro as well as in vivo. For example, in vitro techniques for detection of FCTRX 
mRNA include Northern hybridizations and in situ hybridizations. In vitro techniques for 
detection of FCTRX protein include enzyme linked immunosorbent assays (ELISAs), Westem 
blots, inmiunoprecipitations, and inmiunofluorescence. In vitro techniques for detection of 
FCTRX genomic DNA include Southern hybridizations. Furthermore, in vivo techniques for 
detection of FCTRX protein include introducing into a subject a labeled anti-FCTRX antibody. 
For example, the antibody can be labeled with a radioactive marker whose presence and location 
in a subject can be detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the test 
subject. Alternatively, the biological sample can contain mRNA molecules from the test subject 
or genomic DNA molecules from the test subject. A preferred biological sample is a peripheral 
blood leukocyte sample isolated by conventional means from a subject. 

In another embodiment, the methods fiirther involve obtaining a control biological 
sample from a control subject, contacting the control sample with a compound or agent capable 
of detecting FCTRX protein, mRNA, or genomic DNA, such that the presence of FCTRX 
protein, mRNA or genomic DNA is detected in the biological sample, and comparing the 
presence of FCTRX protein, mRNA or genomic DNA in the control sample with the presence of 
FCTRX protein, mRNA or genomic DNA in the test sample. 

The invention also encompasses kits for detecting the presence of FCTRX in a biological 
sample. For example, the kit can comprise: a labeled compound or agent capable of detecting 
FCTRX protein or mRNA in a biological sample; means for determining the amount of FCTRX 
in the sample; and means for comparing the amount of FCTRX in the sample v^th a standard. 
The compound or agent can be packaged in a suitable container. The kit can fiirther comprise 
instructions for using the kit to detect FCTRX protein or nucleic acid. 
Prognostic Assays 

The diagnostic methods described herein can fiorthermore be utilized to identify subjects 
having or at risk of developing a disease or disorder associated with aberrant FCTRX expression 
or activity. For example, the assays described herein, such as the preceding diagnostic assays or 
the following assays, can be utilized to identify a subject having or at risk of developing a 
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disorder associated witi|*r;TRX protein, nucleic acid expressic^^^activity. Alternatively, the 
prognostic assays can ^Hnilized to identify a subject having or at risk for developing a disease 
or disorder. Thus, the invention provides a method for identifying a disease or disorder 
associated with aberrant FCTRX expression or activity in which a test sample is obtained from a 
5 subject and FCTRX protein or nucleic acid (e.g., mRNA, genomic DNA) is detected, wherein 
the presence of FCTRX protein or nucleic acid is diagnostic for a subject having or at risk of 
developing a disease or disorder associated with aberrant FCTRX expression or activity. As 
used herein, a "test sample" refers to a biological sample obtained fi:om a subject of interest. For 
example, a test sample can be a biological fluid (e.g., serum), cell sample, or tissue. 
10 Furthermore, the prognostic assays described herein can be used to determine whether a 

subject can be administered an agent (e.g., an agonist, antagonist, peptidomimetic, protein, 
peptide, nucleic acid, small molecule, or other drug candidate) to treat a disease or disorder 
associated with aberrant FCTRX expression or activity. For example, such methods can be used 
to determine whether a subject can be effectively treated with an agent for a disorder. Thus, the 
15 invention provides methods for determining whether a subject can be effectively treated with an 
f=^ agent for a disorder associated with aberrant FCTRX expression or activity in which a test 

sample is obtained and FCTRX protein or nucleic acid is detected (e.g., wherein the presence of 
FCTRX protein or nucleic acid is diagnostic for a subject that can be administered the agent to 
treat a disorder associated with aberrant FCTRX expression or activity). 
20 The methods of the invention can also be used to detect genetic lesions in an FCTRX 

gene, thereby determining if a subject with the lesioned gene is at risk for a disorder 
characterized by aberrant cell proliferation and/or differentiation. In various embodiments, the 
methods include detecting, in a sample of cells from the subject, the presence or absence of a 
genetic lesion characterized by at least one of an alteration affecting the integrity of a gene 
25 encoding an FCTRX-protein, or the misexpression of the FCTRX gene. For example, such 

genetic lesions can be detected by ascertaining the existence of at least one of: (/) a deletion of 
one or more nucleotides from an FCTRX gene; (//) an addition of one or more nucleotides to an 
FCTRX gene; (///) a substitution of one or more nucleotides of an FCTRX gene, (/v) a 
chromosomal rearrangement of an FCTRX gene; (v) an alteration in the level of a messenger 
30 RNA transcript of an FCTRX gene, (v/) aberrant modification of an FCTRX gene, such as of the 
methylation pattern of the genomic DNA, (v//) the presence of a non-wild-type splicing pattern 
of a messenger RNA transcript of an FCTRX gene, (v//7) a non-wild-type level of an FCTRX 
protein, (ix) allelic loss of an FCTRX gene, and (x) inappropriate post-translational modification 
of an FCTRX protein. As described herein, there are a large number of assay techniques known 
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in the art which can beji||^d for detecting lesions in an FCTRX^^. A preferred biological 
sample is a peripheral ^^d leukocyte sample isolated by conventional means from a subject. 
However, any biological sample containing nucleated cells may be used, including, for example, 
buccal mucosal cells. 

In certain embodiments, detection of the lesion involves the use of a probe/primer in a 
polymerase chain reaction (PCR) {see, e.g., U.S. Patent Nos. 4,683,195 and 4,683,202), such as 
anchor PCR or RACE PCR, or, alternatively, in a ligation chain reaction (LCR) (see, e.g., 
Landegran, et al, 1988. Science 241 : 1077-1080; and Nakazawa, et al, 1994. Proc. Natl Acad 
Sci. USA 91: 360-364), the latter of which can be particularly useful for detecting point 
mutations in the FCTRX-gene {see, Abravaya, et al., 1995. Nucl. Acids Res. 23: 675-682). This 
method can include the steps of collecting a sample of cells from a patient, isolating nucleic acid 
{e.g., genomic, mRNA or both) from the cells of the sample, contacting the nucleic acid sample 
v^th one or more primers that specifically hybridize to an FCTRX gene under conditions such 
that hybridization and amplification of the FCTRX gene (if present) occurs, and detecting the 
presence or absence of an amplification product, or detecting the size of the amplification 
product and comparing the length to a control sample. It is anticipated that PCR and/or LCR 
may be desirable to use as a preliminary amplification step in conjunction with any of the 
techniques used for detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication {see, 
Guatelli, et al, 1990. Proc. Natl Acad. Set USA 87: 1874-1878), transcriptional amplification 
system {see, Kwoh, etal, 1989. Prac. Natl Acad. Set USA 86: 1173-1177); QP Replicase {see, 
Lizardi, et al, 1988. BioTechnology 6: 1 197), or any other nucleic acid amplification method, 
followed by the detection of the amplified molecules using techniques well known to those of 
skill in the art. These detection schemes are especially useful for the detection of nucleic acid 
molecules if such molecules are present in very low numbers. 

In an alternative embodiment, mutations in an FCTRX gene from a sample cell can be 
identified by alterations in restriction enzyme cleavage patterns. For example, sample and 
control DNA is isolated, amplified (optionally), digested with one or more restriction 
endonucleases, and fragment length sizes are determined by gel electrophoresis and compared. 
Differences in fragment length sizes between sample and control DNA indicates mutations in the 
sample DNA. Moreover, the use of sequence specific ribozymes {see, e.g., U.S. Patent No. 
5,493,53 1) can be used to score for the presence of specific mutations by development or loss of 
a ribozyme cleavage site. 
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In other emboi 




:nts, genetic mutations in FCTRX c; 



identified by hybridizing a 



sample and control nuSRc acids, e.g., DNA or RNA, to high-density arrays containing hundreds 
or thousands of oligonucleotides probes. See, e.g., Cronin, et al., 1996. Human Mutation 7: 
244-255; Kozal, et al, 1996. Nat, Med. 2: 753-759. For example, genetic mutations in FCTRX 
can be identified in two dimensional arrays containing light-generated DNA probes as described 
in Cronin, et aL, supra. Briefly, a first hybridization array of probes can be used to scan through 
long stretches of DNA in a sample and control to identify base changes between the sequences 
by making linear arrays of sequential overlapping probes. This step allows the identification of 
point mutations. This is followed by a second hybridization array that allows the 
characterization of specific mutations by using smaller, specialized probe arrays complementary 
to all variants or mutations detected. Each mutation array is composed of parallel probe sets, one 
complementary to the wild-type gene and the other complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in the art can 
be used to directly sequence the FCTRX gene and detect mutations by comparing the sequence 
of the sample FCTRX with the corresponding wild-type (control) sequence. Examples of 
sequencing reactions include those based on techniques developed by Maxim and Gilbert, 1977. 
Proc. Natl Acad. Sci. USA 74: 560 or Sanger, 1977. Proc. Natl. Acad. Sci. USA 74: 5463. It is 
also contemplated that any of a variety of automated sequencing procedures can be utilized when 
performing the diagnostic assays (see, e.g., Naeve, et aL, 1995.. Bio techniques 19: 448), 
including sequencing by mass spectrometry (see, e.g., PCT International Publication No. WO 
94/16101; Cohen, £7/., \996. Adv. Chromatography 36: 127-162; and Griffin, e/ a/., 1993. . 
Appl. Biochem. Biotechnol 38: 147-159). 

Other methods for detecting mutations in the FCTRX gene include methods in which 
protection from cleavage agents is used to detect mismatched bases in RNA/RNA or RNA/DNA 
heteroduplexes. See, e.g., Myers, et al, 1985. Science 230: 1242. In general, the art technique 
of "mismatch cleavage" starts by providing heteroduplexes of formed by hybridizing (labeled) 
RNA or DNA containing the wild-type FCTRX sequence with potentially mutant RNA or DNA 
obtained from a tissue sample. The double-stranded duplexes are treated with an agent that 
cleaves single-stranded regions of the duplex such as which will exist due to basepair 
mismatches between the control and sample strands. For instance, RNA/DNA duplexes can be 
treated with RNase and DNA/DNA hybrids treated with S| nuclease to enzymatically digesting 
the mismatched regions. In other embodiments, either DNA/DNA or RNA/DNA duplexes can 
be treated with hydroxylamine or osmium tetroxide and with piperidine in order to digest 
mismatched regions. After digestion of the mismatched regions, the resulting material is then 
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separated by size on d( 




ing polyacrylamide gels to determi 



e site of mutation. See, e.g.. 



Cotton, et al, 1988. PfWNatl Acad, ScL USA 85: 4397; Saleeba, et aL 1992. Methods 
Enzymol 217: 286-295. In an embodiment, the control DNA or RNA can be labeled for 
detection. 

In still another embodiment, the mismatch cleavage reaction employs one or more 
proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 
mismatch repair" enzymes) in defined systems for detecting and mapping point mutations in 
FCTRX cDNAs obtained from samples of cells. For example, the mutY enzyme of E, coli 
cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells cleaves T at 
G/T mismatches. See, e.g., Hsu, etal., 1994. Carcinogenesis 15: 1657-1662. According to an 
exemplary embodiment, a probe based on an FCTRX sequence, e.g., a wild-type FCTRX 
sequence, is hybridized to a cDNA or other DNA product from a test cell(s). The duplex is 
treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be detected 
from electrophoresis protocols or the like. See, e.g., U.S. Patent No. 5,459,039. 

In other embodiments, alterations in electrophoretic mobility will be used to identify 
mutations in FCTRX genes. For example, single strand conformation polymorphism (SSCP) 
may be used to detect differences in electrophoretic mobility between mutant and wild type 
nucleic acids. See, e.g., Orita, etal., 1989. Proc. Natl. Acad Sci. USA: 86: 2766; Cotton, 1993. 
Mutat. Res. 285: 125-144; Hayashi, 1992. Genet. Anal. Tech. Appl. 9: 73-79. Single-stranded 
DNA fragments of sample and control FCTRX nucleic acids v^U be denatured and allowed to 
renature. The secondary structure of single-stranded nucleic acids varies according to sequence, 
the resulting alteration in electrophoretic mobility enables the detection of even a single base 
change. The DNA fragments may be labeled or detected v^th labeled probes. The sensitivity of 
the assay may be enhanced by using RNA (rather than DNA), in which the secondary structure is 
more sensitive to a change in sequence. In one embodiment, the subject method utilizes 
heteroduplex analysis to separate double stranded heteroduplex molecules on the basis of 
changes in electrophoretic mobility. See, e.g., Keen, et ai, 1991. Trends Genet. 7: 5. 

In yet another embodiment, the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing gradient gel 
electrophoresis (DGGE). See, e.g., Myers, etal, 1985. Nature 313: 495. WhenDGGE is used 
as the method of analysis, DNA will be modified to insure that it does not completely denature, 
for example by adding a GC clamp of approximately 40 bp of high-melting GC-rich DNA by 
PCR. In a further embodiment, a temperature gradient is used in place of a denaturing gradient 
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le mobility of control and 



sample Dl 




>ee, e.g., Rosenbaum and 



Reissner, 1987. Bioph^Smhem. 265: 12753. 

Examples of other techniques for detecting point mutations include, but are not limited 
to, selective oligonucleotide hybridization, selective amplification, or selective primer extension. 
For example, oligonucleotide primers may be prepared in which the known mutation is placed 
centrally and then hybridized to target DNA under conditions that permit hybridization only if a 
perfect match is found. See, e.g., Saiki, et al, 1986. Nature 324: 163; Saiki, et al, 1989. Proc. 
Natl. Acad. ScL USA 86: 6230. Such allele specific oligonucleotides are hybridized to PGR 
amplified target DNA or a number of different mutations when the oligonucleotides are attached 
to the hybridizing membrane and hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology that depends on selective PGR 
amplification may be used in conjunction with the instant invention. Oligonucleotides used as 
primers for specific amplification may carry the mutation of interest in the center of the molecule 
(so that amplification depends on differential hybridization; see, e.g., Gibbs, et aL, 1989. Nucl. 
Acids Res. 17: 2437-2448) or at the extreme 3'-terminus of one primer where, under appropriate 
conditions, mismatch can prevent, or reduce polymerase extension (see, e.g., Prossner, 1993. 
Tibtech. 1 1 : 238). In addition it may be desirable to introduce a novel restriction site in the 
region of the mutation to create cleavage-based detection. See, e.g., Gasparini, et al, 1992. Mol 
Cell Probes 6:1. It is anticipated that in certain embodiments amplification may also be 
performed using Taq ligase for amplification. See, e.g., Barany, 1991. Proc. Natl Acad. Sci. 
USA 88: 1 89. In such cases, ligation will occur only if there is a perfect match at the 3'-terminus 
of the 5' sequence, making it possible to detect the presence of a knovra mutation at a specific 
site by looking for the presence or absence of amplification. 

The methods described herein may be performed, for example, by utilizing pre-packaged 
diagnostic kits comprising at least one probe nucleic acid or antibody reagent described herein, 
which may be conveniently used, e.g., in clinical settings to diagnose patients exhibiting 
symptoms or family history of a disease or illness involving an FGTRX gene. 

Furthermore, any cell type or tissue, preferably peripheral blood leukocytes, in which 
FGTRX is expressed may be utilized in the prognostic assays described herein. However, any 
biological sample containing nucleated cells may be used, including, for example, buccal 
mucosal cells. 

Pharmacogenomics 

Agents, or modulators that have a stimulatory or inhibitory effect on FGTRX activity 
{e.g., FGTRX gene expression), as identified by a screening assay described herein can be 
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administered to indivij 




Is to treat 



(prophylactically or therapi 




Jly) disorders (The disorders 



include metabolic dis^lR-s, Also within the scope of the invention is the use of a Therapeutic in 
the manufacture of a medicament for treating or preventing disorders or syndromes including, 
e.g.. Colorectal cancer, adenomatous polyposis coli, myelogenous leukemia, congenital ceonatal 
5 alloimmune thrombocytopenia, multiple himian solid malignancies, malignant ovarian tumours 
particularly at the interface between epithelia and stroma, malignant brain tumors, mammary 
tumors, human gliomas, astrocytomas, mixed glioma/astrocytomas, renal cells carcinoma, breast 
adenocarcinoma, ovarian cancer, melanomas, renal cell carcinoma , clear cell and granular cell 
carcinomas, autocrine/paracrine stimulation of tumor cell proliferation, autocrine/paracrine 
10 stimulation of tumor cell survival and tumor cell resistance to cytotoxic therapy, paranechmal 
and basement membrane invasion and motility of tumor cells thereby contributing to metastasis, 
tumor-mediated immunosuppression of T-cell mediated immune effector cells and pathways 
resulting in tumor escape from immune surveilance, neurological disorders, neurodegenerative 
disorders, nerve trauma, familial myelodysplastic syndrome, Charcot-Marie-Tooth neuropathy, 
15 demyelinating Gardner syndrome, familial myelodysplastic syndrome; mental health conditions, 
immunological disorders, allergy and infection, asthma, bronchial asthma, Avellino type 
eosinophilia, lung diseases, reproductive disorders, male infertility, female reproductive system 
disorders, male and female reproductive diseases, hemangioma, deafness, glycoprotein la 
deficiency, desmoid disease, turcot syndrome, liver cirrhosis, hepatitis C, gastric disorders, 
20 pancreatic diseases like diabetes. Schistosoma mansoni infection. Spinocerebellar ataxia, 

Plasmodium falciparum parasitemia, Corneal dystrophy -Groenouw type I, Corneal dystrophy - 
lattice type I, and Reis-Bucklers corneal dystrophy) In conjunction with such treatment, the 
pharmacogenomics (/.e., the study of the relationship between an individual's genotype and that 
individual's response to a foreign compound or drug) of the individual may be considered. 
25 Differences in metabolism of therapeutics can lead to severe toxicity or therapeutic failure by 

altering the relation between dose and blood concentration of the pharmacologically active drug. 
Thus, the pharmacogenomics of the individual permits the selection of effective agents (e.g., 
drugs) for prophylactic or therapeutic treatments based on a consideration of the individual's 
genotype. Such pharmacogenomics can further be used to determine appropriate dosages and 
30 therapeutic regimens. Accordingly, the activity of FCTRX protein, expression of FCTRX nucleic 
acid, or mutation content of FCTRX genes in an individual can be determined to thereby select 
appropriate agent(s) for therapeutic or prophylactic treatment of the individual. 

Pharmacogenomics deals vsdth clinically significant hereditary variations in the response 
to drugs due to altered drug disposition and abnormal action in affected persons. See e.g., 
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Eichelbaum, 1996. Cl^^xp. Pharmacol Physiol, 23: 983-98^jnder, 1997. Clin. Chem., 43: 
254-266. In general, ^^types of pharmacogenetic conditions can be differentiated. Genetic 
conditions transmitted as a single factor altering the way drugs act on the body (altered drug 
action) or genetic conditions transmitted as single factors altering the way the body acts on drugs 
5 (altered drug metabolism). These pharmacogenetic conditions can occur either as rare defects or 
as polymorphisms. For example, glucose-6-phosphate dehydrogenase (G6PD) deficiency is a 
common inherited enzymopathy in which the main clinical complication is hemolysis after 
ingestion of oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and 
consumption of fava beans. 

10 As an illustrative embodiment, the activity of drug metabolizing enzymes is a major 

determinant of both the intensity and duration of drug action. The discovery of genetic 
polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 2) and 
cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation as to why 
some patients do not obtain the expected drug effects or show exaggerated drug response and 

15 serious toxicity after taking the standard and safe dose of a drug. These polymorphisms are 

m 

ri expressed in two phenotypes in the population, the extensive metabolizer (EM) and poor 

metabolizer (PM). The prevalence of PM is different among different populations. For example, 
vP the gene coding for CYP2D6 is highly polymorphic and several mutations have been identified 

in PM, which all lead to the absence of functional CYP2D6. Poor metabolizers of CYP2D6 and 
y'{ 20 CYP2C19 quite frequently experience exaggerated drug response and side effects when they 
fij . receive standard doses. If a metabolite is the active therapeutic moiety, PM show no therapeutic 
p.; response, as demonstrated for the analgesic effect of codeine mediated by its CYP2D6-formed 

metabolite morphine. At the other extreme are the so called ultra-rapid metabolizers who do not 
respond to standard doses. Recently, the molecular basis of ultra-rapid metabolism has been 
25 identified to be due to CYP2D6 gene amplification. 

Thus, the activity of FCTRX protein, expression of FCTRX nucleic acid, or mutation 
content of FCTRX genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. In addition, pharmacogenetic 
studies can be used to apply genotyping of polymorphic alleles encoding drug-metabolizing 
30 enzymes to the identification of an individual's drug responsiveness phenotype. This knowledge, 
when applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure and 
thus enhance therapeutic or prophylactic efficiency when treating a subject with an FCTRX 
modulator, such as a modulator identified by one of the exemplary screening assays described 
herein. 
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Monitoring ofEtCects During Clinical Trials 

Monitoring theHmience of agents (e.g., drugs, compounas) on the expression or activity 
of FCTRX (e.g., the ability to modulate aberrant cell proliferation and/or differentiation) can be 
applied not only in basic drug screening, but also in clinical trials. For example, the 
5 effectiveness of an agent determined by a screening assay as described herein to increase 
FCTRX gene expression, protein levels, or upregulate FCTRX activity, can be monitored in 
clinical trails of subjects exhibiting decreased FCTRX gene expression, protein levels, or 
downregulated FCTRX activity. Alternatively, the effectiveness of an agent determined by a 
screening assay to decrease FCTRX gene expression, protein levels, or downregulate FCTRX 
10 activity, can be monitored in clinical trails of subjects exhibiting increased FCTRX gene 

expression, protein levels, or upregulated FCTRX activity. In such clinical trials, the expression 
or activity of FCTRX and, preferably, other genes that have been implicated in, for example, a 
cellular proliferation or immune disorder can be used as a "read out" or markers of the immune 
responsiveness of a particular cell, 
ifl 15 By way of example, and not of limitation, genes, including FCTRX, that are modulated in 

tL cells by treatment with an agent (e.g., compound, drug or small molecule) that modulates 

£=l FCTRX activity (e.g., identified in a screening assay as described herein) can be identified. 

•j\ Thus, to study the effect of agents on cellular proliferation disorders, for example, in a clinical 

^ trial, cells can be isolated and RNA prepared and analyzed for the levels of expression of 

P 20 FCTRX and other genes implicated in the disorder. The levels of gene expression (i.e., a gene 
f=i expression pattem) can be quantified by Northern blot analysis or RT-PCR, as described herein, 

^} or alternatively by measuring the amount of protein produced, by one of the methods as 

1:=:-: described herein, or by measuring the levels of activity of FCTRX or other genes. In this 

manner, the gene expression pattem can serve as a marker, indicative of the physiological 
25 response of the cells to the agent. Accordingly, this response state may be determined before, 
and at various points during, treatment of the individual with the agent. 

In one embodiment, the invention provides a method for monitoring the effectiveness of 
treatment of a subject with an agent (e.g., an agonist, antagonist, protein, peptide, 
peptidomimetic, nucleic acid, small molecule, or other drug candidate identified by the screening 
30 assays described herein) comprising the steps of (/) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (//) detecting the level of expression of an FCTRX 
protein, mRNA, or genomic DNA in the preadministration sample; (///) obtaining one or more 
post-administration samples from the subject; (zv) detecting the level of expression or activity of 
the FCTRX protein, mRNA, or genomic DNA in the post-administration samples; (v) comparing 
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the level of expressionj|^ctivity of the FCTRX protein, mRN genomic DNA in the 
pre-administration san^^with the FCTRX protein, mRNA, or genomic DNA in the post 
administration sample or samples; and (v/) altering the administration of the agent to the subject 
accordingly. For example, increased administration of the agent may be desirable to increase the 
5 expression or activity of FCTRX to higher levels than detected, /.e., to increase the effectiveness 
of the agent. Alternatively, decreased administration of the agent may be desirable to decrease 
expression or activity of FCTRX to lower levels than detected, i.e., to decrease the effectiveness 
of the agent. 

Methods of Treatment 

10 The invention provides for both prophylactic and therapeutic methods of treating a 

subject at risk of (or susceptible to) a disorder or having a disorder associated with aberrant 
FCTRX expression or activity. The disorders include cardiomyopathy, atherosclerosis, 
hypertension, congenital heart defects, aortic stenosis, atrial septal defect (ASD), 
atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis, subaortic stenosis, 
JJf 15 ventricular septal defect (VSD), valve diseases, tuberous sclerosis, scleroderma, obesity, 

fcsr 

□ transplantation, adrenoleukodystrophy, congenital adrenal hyperplasia, prostate cancer, 

neoplasm; adenocarcinoma, lymphoma, uterus cancer, fertility, hemophilia, hypercoagulation, 
idiopathic thrombocytopenic purpura, immimodeficiencies, graft versus host disease, AIDS, 

r j bronchial asthma, Crohn's disease; multiple sclerosis, treatment of Albright Hereditary 

3 _ 3 

p| 20 Ostoeodystrophy, and other diseases, disorders and conditions of the like. 
In These methods of treatment will be discussed more ftilly, below. 

il Disease and Disorders 

r — 

Diseases and disorders that are characterized by increased (relative to a subject not 
suffering from the disease or disorder) levels or biological activity may be treated with 

25 Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that antagonize 

activity may be administered in a therapeutic or prophylactic manner. Therapeutics that may be 
utilized include, but are not limited to: (/) an aforementioned peptide, or analogs, derivatives, 
fragments or homologs thereof; (//) antibodies to an aforementioned peptide; (///) nucleic acids 
encoding an aforementioned peptide; (/v) administration of antisense nucleic acid and nucleic 

30 acids that are "dysfunctional" {i.e., due to a heterologous insertion within the coding sequences 
of coding sequences to an aforementioned peptide) that are utilized to "knockout" endoggenous 
fimction of an aforementioned peptide by homologous recombination (see, e.g., Capecchi, 1989. 
Science 244: 1288-1292); or (v) modulators ( i.e., inhibitors, agonists and antagonists, including 
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a peptide of the invention) 



that alter the interactiorWetween an aforementioned peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject not 
suffering from the disease or disorder) levels or biological activity may be treated with 
Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that upregulate activity 
may be administered in a therapeutic or prophylactic manner. Therapeutics that may be utilized 
include, but are not limited to, an aforementioned peptide, or analogs, derivatives, fragments or 
homologs thereof; or an agonist that increases bioavailability. 

Increased or decreased levels can be readily detected by quantifying peptide and/or RNA, 
by obtaining a patient tissue sample (e.g., from biopsy tissue) and assaying it in vitro for RNA or 
peptide levels, structure and/or activity of the expressed peptides (or mRNAs of an 
aforementioned peptide). Methods that are well-known within the art include, but are not limited 
to, immunoassays (e.g., by Western blot analysis, immunoprecipitation followed by sodium 
dodecyl sulfate (SDS) polyacrylamide gel electrophoresis, immunocytochemistry, etc.) and/or 
hybridization assays to detect expression of mRNAs (e.g., Northern assays, dot blots, in situ 
hybridization, and the like). 

Prophylactic Methods 

In one aspect, the invention provides a method for preventing, in a subject, a disease or 
condition associated with an aberrant FCTRX expression or activity, by administering to the 
subject an agent that modulates FCTRX expression or at least one FCTRX activity. Subjects at 
risk for a disease that is caused or contributed to by aberrant FCTRX expression or activity can 
be identified by, for example, any or a combination of diagnostic or prognostic assays as 
described herein. Administration of a prophylactic agent can occur prior to the manifestation of 
symptoms characteristic of the FCTRX aberrancy, such that a disease or disorder is prevented or, 
alternatively, delayed in its progression. Depending upon the type of FCTRX aberrancy, for 
example, an FCTRX agonist or FCTRX antagonist agent can be used for treating the subject. 
The appropriate agent can be determined based on screening assays described herein. The 
prophylactic methods of the invention are further discussed in the following subsections. 

Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating FCTRX expression or 
activity for therapeutic purposes. The modulatory method of the invention involves contacting a 
cell with an agent that modulates one or more of the activities of FCTRX protein activity 
associated with the cell. An agent that modulates FCTRX protein activity can be an agent as 
described herein, such as a nucleic acid or a protein, a naturally-occurring cognate ligand of an 
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FCTRX protein, a pep] 




an 



FCTRX peptidomimetic, or othei 




1 molecule. In one 



embodiment, the agent^mulates one or more FCTRX protein activity. Examples of such 
stimulatory agents include active FCTRX protein and a nucleic acid molecule encoding FCTRX 
that has been introduced into the cell. In another embodiment, the agent inhibits one or more 
FCTRX protein activity. Examples of such inhibitory agents include antisense FCTRX nucleic 
acid molecules and anti-FCTRX antibodies. These modulatory methods can be performed in 
vitro (e.g., by culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering 
the agent to a subject). As such, the invention provides methods of treating an individual 
afflicted with a disease or disorder characterized by aberrant expression or activity of an FCTRX 
protein or nucleic acid molecule. In one embodiment, the method involves administering an 
agent (e.g., an agent identified by a screening assay described herein), or combination of agents 
that modulates (e.g., up-regulates or down-regulates) FCTRX expression or activity. In another 
embodiment, the method involves administering an FCTRX protein or nucleic acid molecule as 
therapy to compensate for reduced or aberrant FCTRX expression or activity. 

Stimulation of FCTRX activity is desirable in situations in which FCTRX is abnormally 
downregulated and/or in which increased FCTRX activity is likely to have a beneficial effect. 
One example of such a situation is where a subject has a disorder characterized by aberrant cell 
proliferation and/or differentiation (e.g., cancer or immune associated disorders). Another 
example of such a situation is where the subject has a gestational disease (e.g., preclampsia). 

Determination of the Biological Effect of the Therapeutic 

In various embodiments of the invention, suitable in vitro or in vivo assays are performed 
to determine the effect of a specific Therapeutic and whether its administration is indicated for 
treatment of the affected tissue. 

In various specific embodiments, in vitro assays may be performed with representative 
cells of the type(s) involved in the patient's disorder, to determine if a given Therapeutic exerts 
the desired effect upon the cell type(s). Compounds for use in therapy may be tested in suitable 
animal model systems including, but not limited to rats, mice, chicken, cows, monkeys, rabbits, 
and the like, prior to testing in human subjects. Similarly, for in vivo testing, any of the animal 
model system known in the art may be used prior to administration to human subjects. 

Prophylactic and Therapeutic Uses of the Compositions of the Invention 

The FCTRX nucleic acids and proteins of the invention are useful in potential 
prophylactic and therapeutic applications implicated in a variety of disorders including, but not 
limited to: Also within the scope of the invention is the use of a Therapeutic in the manufacture 
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of a medicament for tr\ 




ig or preventing disorders or syndron^Jbicluding, e.g., Colorectal 




cancer, adenomatous p^posis coli, myelogenous leukemia, congenital ceonatal alloimmune 
thrombocytopenia, multiple human solid malignancies, malignant ovarian tumours particularly at 
the interface between epithelia and stroma, malignant brain tumors, mammary tumors, human 
gliomas, astrocytomas, mixed glioma/astrocytomas, renal cells carcinoma, breast 
adenocarcinoma, ovarian cancer, melanomas, renal cell carcinoma , clear cell and granular cell 
carcinomas, autocrine/paracrine stimulation of tumor cell proliferation, autocrine/paracrine 
stimulation of tumor cell survival and tumor cell resistance to cytotoxic therapy, paranechmal 
and basement membrane invasion and motility of tumor cells thereby contributing to metastasis, 
tumor-mediated immunosuppression of T-cell mediated immune effector cells and pathways 
resulting in tumor escape from immune surveilance, neurological disorders, neurodegenerative 
disorders, nerve trauma, familial myelodysplastic syndrome, Charcot-Marie-Tooth neuropathy, 
demyelinating Gardner syndrome, familial myelodysplastic syndrome; mental health conditions, 
immunological disorders, allergy and infection, asthma, bronchial asthma, Avellino type 
eosinophilia, limg diseases, reproductive disorders, male infertility, female reproductive system 
disorders, male and female reproductive diseases, hemangioma, deaftiess, glycoprotein la 
deficiency, desmoid disease, turcot syndrome, liver cirrhosis, hepatitis C, gastric disorders, 
pancreatic diseases like diabetes, Schistosoma mansoni infection. Spinocerebellar ataxia, 
Plasmodium falciparum parasitemia. Corneal dystrophy -Groenouw type I, Corneal dystrophy - 
lattice type I, and Reis-Bucklers corneal dystrophy. 

As an example, a cDNA encoding the FCTRX protein of the invention may be useful in 
gene therapy, and the protein may be useful when administered to a subject in need thereof. By 
way of non-limiting example, the compositions of the invention will have efficacy for treatment 
of patients suffering from: Also v^thin the scope of the invention is the use of a Therapeutic in 
the manufacture of a medicament for treating or preventing disorders or syndromes including, 
e.g.. Colorectal cancer, adenomatous polyposis coli, myelogenous leukemia, congenital ceonatal 
alloimmune thrombocytopenia, multiple human solid malignancies, malignant ovarian tumours 
particularly at the interface between epithelia and stroma, malignant brain tumors, mammary 
tumors, human gliomas, astrocytomas, mixed glioma/astrocytomas, renal cells carcinoma, breast 
adenocarcinoma, ovarian cancer, melanomas, renal cell carcinoma , clear cell and granular cell 
carcinomas, autocrine/paracrine stimulation of tumor cell proliferation, autocrine/paracrine 
stimulation of tumor cell survival and tumor cell resistance to cytotoxic therapy, paranechmal 
and basement membrane invasion and motility of tumor cells thereby contributing to metastasis, 
tumor-mediated immunosuppression of T-cell mediated immune effector cells and pathways 
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resulting in tumor esc; 




om immune 



surveilance, neurologic 




lorders, neurodegenerative 



disorders, nerve traum^ronilial myelodysplastic syndrome, Charcot-Marie-Tooth neuropathy, 
demyelinating Gardner syndrome, familial myelodysplastic syndrome; mental health conditions, 
immunological disorders, allergy and infection, asthma, bronchial asthma, Avellino type 
eosinophilia, lung diseases, reproductive disorders, male infertility, female reproductive system 
disorders, male and female reproductive diseases, hemangioma, deafness, glycoprotein la 
deficiency, desmoid disease, turcot syndrome, liver cirrhosis, hepatitis C, gastric disorders, 
pancreatic diseases like diabetes, Schistosoma mansoni infection. Spinocerebellar ataxia, 
Plasmodium falciparum parasitemia. Corneal dystrophy -Groenouv^ type I, Corneal dystrophy - 
lattice type I, and Reis-Bucklers comeal dystrophy. 

Both the novel nucleic acid encoding the FCTRX protein, and the FCTRX protein of the 
invention, or fragments thereof, may also be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. A further use could be 
as an anti-bacterial molecule (i.e,, some peptides have been found to possess anti -bacterial 
properties). These materials are further useful in the generation of antibodies which 
immunospecifically-bind to the novel substances of the invention for use in therapeutic or 
diagnostic methods. 



The following examples illustrate by way of non-limiting example various aspects of the 
invention. 

The following examples illustrate by way of non-limiting example various aspects of the 
invention. 

Example 1: Method of Identifying the Nucleic Acids 

The novel nucleic acids of the invention were identified by TblastN using a proprietary 
sequence file, run against the Genomic Daily Files made available by GenBank. The nucleic 
acids were further predicted by the proprietary software program GenScan"^^, including selection 
of exons. These were further modified by means of similarities using BLAST searches. The 
sequences were then manually corrected for apparent inconsistencies, thereby obtaining the 
sequences encoding the full-length proteins. 



EXAMPLES 
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Example 2. QuaDtita|||||^expression analysis of FCTR2 in v^^is cells and tissues 

The quantitative expression of various clones was assessed using microtiter plates 
containing RNA samples from a variety of normal and pathology-derived cells, cell lines and 
tissues using real time quantitative PGR (RTQ PGR; TAQMAN^. RTQ PGR was performed 
on a Perkin-Elmer Biosystems ABI PRISM® 7700 Sequence Detection System. Various 
collections of samples are assembled on the plates, and referred to as Panel 1 (containing cells 
and cell lines from normal and cancer sources). Panel 2 (containing samples derived from 
tissues, in particular from surgical samples, from normal and cancer sources), Panel 3 
(containing samples derived from a wide variety of cancer sources) and Panel 4 (containing cells 
and cell lines from normal cells and cells related to inflammatory conditions). 

First, the RNA samples were normalized to constitutively expressed genes such as P- 
actin and GAPDH. RNA (-50 ng total or -1 ng polyA+) was converted to cDNA using the 
TAQMAN® Reverse Transcription Reagents Kit (PE Biosystems, Foster Gity, GA; Gatalog No. 
N808-0234) and random hexamers according to the manufacturer's protocol. Reactions were 
performed in 20 ul and incubated for 30 min. at 48^G. cDNA (5 ul) was then transferred to a 
separate plate for the TAQMAN® reaction using p-actin and GAPDH TAQMAN® Assay 
Reagents (PE Biosystems; Gatalog Nos. 43 1088 IE and 4310884E, respectively) and 
TAQMAN® universal PGR Master Mix (PE Biosystems; Gatalog No. 4304447) according to the 
manufacturer's protocol. Reactions were performed in 25 ul using the following parameters: 2 
min. at 50**G; 10 mm. at 95^G; 15 sec. at 95^G/1 min. at 60^G (40 cycles). Results were recorded 
as GT values (cycle at which a given sample crosses a threshold level of fluorescence) using a 
log scale, with the difference in RNA concentration between a given sample and the sample with 
the lowest GT value being represented as 2 to the power of delta GT. The percent relative 
expression is then obtained by taking the reciprocal of this RNA difference and multiplying by 
100. The average GT values obtained for 6-actin and GAPDH were used to normalize RNA 
samples. The RNA sample generating the highest GT value required no further diluting, while 
all other samples were diluted relative to this sample according to their p-actin /GAPDH average 
GT values. 

Normalized RNA (5 ul)'was converted to cDNA and analyzed via TAQMAN® using 
One Step RT-PGR Master Mix Reagents (PE Biosystems; Gatalog No. 4309169) and gene- 
specific primers according to the manufacturer's instructions. Probes and primers were designed 
for each assay according to Perkin Elmer Biosystem's Primer Express Software package 
(version I for Apple Gomputer's Macintosh Power PG) or a similar algorithm using the target 
sequence as input. Default settings were used for reaction conditions and the following 
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parameters were set b< 




selecting primers: primer concentra] 



250 nM, primer melting 



temperature (Tm) rang^^8°-60° C, primer optimal Tm = 59*^ C, maximum primer difference = 
2° C, probe does not have 5' G, probe Tm must be 10° C greater than primer Tm, amplicon size 75 
bp to 100 bp. The probes and primers selected (see below) were synthesized by Synthegen 
(Houston, TX, USA). Probes were double purified by HPLC to remove uncoupled dye and 
evaluated by mass spectroscopy to verify coupling of reporter and quencher dyes to the 5' and 3' 
ends of the probe, respectively. Their final concentrations were: forward and reverse primers, 
900 nM each, and probe, 200nM. 

PGR conditions: Normalized RNA from each tissue and each cell line was spotted in 
each well of a 96 well PGR plate (Perkin Elmer Biosystems). PGR cocktails including two 
probes (a probe specific for the target clone and another gene-specific probe multiplexed with the 
target probe) were set up using IX TaqMan"^^ PGR Master Mix for the PE Biosystems 7700, 
with 5 mM MgG12, dNTPs (dA, G, G, U at 1 : 1 : 1 :2 ratios), 0.25 U/ml AmpliTaq Gold™ (PE 
Biosystems), and 0.4 U/|il RNase inhibitor, and 0.25 U/^il reverse transcriptase. Reverse 
transcription was performed at 48*^ G for 30 minutes followed by amplification/PGR cycles as 
follows: 95° G 10 min, then 40 cycles of 95° G for 15 seconds, 60° G for 1 minute. 

In the results for Panel 1, the following abbreviations are used: 

ca. = carcinoma, 

* = established from metastasis, 

met = metastasis, 

s cell var= small cell variant, 

non-s = non-sm =non-small, 

squam = squamous, 

pi. eff = pi effusion = pleural effusion, 

glio = glioma, 

astro = astrocytoma, and 

neuro = neuroblastoma. 



The plates for Panel 2 generally include 2 control wells and 94 test samples composed of 
RNA or cDNA isolated from human tissue procured by surgeons working in close cooperation 



Panel 2 
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with the National CaiiMj^stitute's Cooperative Human Tissu^^^pvork (CHTN) or the 
National Disease Resel^ff Initiative (NDRI). The tissues are denved from human malignancies 
and in cases where indicated many malignant tissues have "matched margins" obtained from 
noncancerous tissue just adjacent to the tumor. These are termed normal adjacent tissues and are 
5 denoted "NAT" in the results below. The tumor tissue and the "matched margins" are evaluated 
by two independent pathologists (the surgical pathologists and again by a pathologists at NDRI 
or CHTN). This analysis provides a gross histopathological assessment of tumor differentiation 
grade. Moreover, most samples include the original surgical pathology report that provides 
information regarding the clinical stage of the patient. These matched margins are taken from the 
10 tissue surrounding (i.e. immediately proximal) to the zone of surgery (designated "NAT", for 
normal adjacent tissue, in Table RR). In addition, RNA and cDNA samples were obtained from 
various human tissues derived from autopsies performed on elderly people or sudden death 
victims (accidents, etc.). These tissue were ascertained to be free of disease and were purchased 
from various commercial sources such as Clontech (Palo Alto, CA), Research Genetics, and 
ifl 15 Invitrogen. 

O RNA integrity from all samples is controlled for quality by visual assessment of agarose 

^ gel electropherograms using 28S and 18S ribosomal RNA staining intensity ratio as a guide (2:1 

to 2.5 : 1 28s: 1 8s) and the absence of low molecular weight RNAs that would be indicative of 
tJ 20 degradation products. Samples are controlled against genomic DNA contamination by RTQ 
f=l PGR reactions run in the absence of reverse transcriptase using probe and primer sets designed to 

J;] amplify across the span of a single exon. 

Panel 4 

25 

Panel 4 includes samples on a 96 well plate (2 control wells, 94 test samples) composed 
of RNA (Panel 4r) or cDNA (Panel 4d) isolated from various human cell lines or tissues related 
to inflammatory conditions. Total RNA from control normal tissues such as colon and lung 
(Stratagene ,La JoUa, CA) and thymus and kidney (Clontech) were employed. Total RNA from 
30 liver tissue from cirrhosis patients and kidney from lupus patients was obtained from BioChain 
(Biochain Institute, Inc., Hayward, CA). Intestinal tissue for RNA preparation from patients 
diagnosed as having Crohn's disease and ulcerative colitis was obtained from the National 
Disease Research Interchange (NDRI) (Philadelphia, PA). 
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Astrocytes, li 




►roblasts, dermal fibroblasts, coron; 




jery 



smooth muscle cells. 



small airway epitheliui^^ronchial epithelium, microvascular dermal endothelial cells, 
microvascular lung endothelial cells, himian pulmonary aortic endothelial cells, himian umbilical 
vein endothelial cells were all purchased from Clonetics (Walkersville, MD) and grown in the 
media supplied for these cell types by Clonetics. These primary cell types were activated with 
various cytokines or combinations of cytokines for 6 and/or 12-14 hours, as indicated. The 
following cytokines were used; IL-1 beta at approximately 1-5 ng/ml, TNF alpha at 
approximately 5-10 ng/ml, IFN ganuna at approximately 20-50 ng/ml, IL-4 at approximately 5- 
10 ng/ml, IL-9 at approximately 5-10 ng/ml, IL-13 at approximately 5-10 ng/ml. Endothelial 
cells were sometimes starved for various times by culture in the basal media from Clonetics with 
0.1% serum. 

Mononuclear cells were prepared from blood of employees at CuraGen Corporation, 
using Ficoll. LAK cells were prepared from these cells by culture in DMEM 5% PCS (Hy clone), 
100 |aM non essential amino acids (Gibco/Life Technologies, Rockville, MD), 1 mM sodium 
pyruvate (Gibco), mercaptoethanol 5.5 x 10'^ M (Gibco), and 10 mM Hepes (Gibco) and 
Interleukin 2 for 4-6 days. Cells were then either activated with 10-20 ng/ml PMA and 1-2 
|Lig/ml ionomycin, IL-12 at 5-10 ng/ml, IFN gamma at 20-50 ng/ml and IL-1 8 at 5-10 ng/ml for 6 
hours. In some cases, mononuclear cells were cultured for 4-5 days in DMEM 5% FCS 
(Hy clone), 100 |aM non essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), 
mercaptoethanol 5.5 x 10"^ M (Gibco), and 10 mM Hepes (Gibco) with PHA 
(phytohemagglutinin) or PWM (pokeweed mitogen) at approximately 5 |ug/ml. Samples were 
taken at 24, 48 and 72 hours for RNA preparation. MLR (mixed lymphocyte reaction) samples 
were obtained by taking blood from two donors, isolating the mononuclear cells using Ficoll and 
mixing the isolated mononuclear cells 1:1 at a final concentration of approximately 2x10^ 
cells/ml in DMEM 5% FCS (Hyclone), 100 |aM non essential amino acids (Gibco), 1 mM 
sodium pyruvate (Gibco), mercaptoethanol (5.5 x 10*^ M) (Gibco), and 10 mM Hepes (Gibco). 
The MLR was cultured and samples taken at various time points ranging from 1- 7 days for RNA 
preparation. 

Monocytes were isolated from mononuclear cells using CD 14 Miltenyi Beads, +ve VS 
selection columns and a Vario Magnet according to the manufacturer's instructions. Monocytes 
were differentiated into dendritic cells by culture in DMEM 5% fetal calf serum (FCS) 
(Hyclone, Logan, UT), 100 [iM non essential amino acids (Gibco), 1 mM sodium pyruvate 
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(Gibco), mercaptoethM|^.5 x 10'^ M (Gibco), and 10 mM He^|Gibco), 50 ng/ml GMCSF 
and 5 ng/ml IL-4 for 5^^ys. Macrophages were prepared by culture of monocytes for 5-7 days 
in DMEM 5% FCS (Hyclone), 100 |iM non essential amino acids (Gibco), 1 mM sodium 
pyruvate (Gibco), mercaptoethanol 5.5 x 10"^ M (Gibco), 10 mM Hepes (Gibco) and 10% AB 
Human Serum or MCSF at approximately 50 ng/ml. Monocytes, macrophages and dendritic cells 
were stimulated for 6 and 12-14 hours with lipopolysaccharide (LPS) at 100 ng/ml. Dendritic 
cells were also stimulated with anti-CD40 monoclonal antibody (Pharmingen) at 10 |ig/ml for 6 
and 12-14 hours. 

CD4 lymphocytes, CDS lymphocytes and NK cells were also isolated from mononuclear 
cells using CD4, CDS and CD56 Miltenyi beads, positive VS selection columns and a Vario 
Magnet according to the manufacturer's instructions. CD45RA and CD45RO CD4 lymphocytes 
were isolated by depleting mononuclear cells of CDS, CD56, CD14 and CD19 cells using CDS, 
CD56, CD 14 and CD 19 Miltenyi beads and +ve selection. Then CD45RO beads were used to 
isolate the CD45RO CD4 lymphocytes v^th the remaining cells being CD45RA CD4 
lymphocytes. CD45RA CD4, CD45RO CD4 and CDS lymphocytes were placed in DMEM 5% 
FCS (Hyclone), 100 |aM non essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), 
mercaptoethanol 5.5 x 10"^ M (Gibco), and 10 mM Hepes (Gibco) and plated at 10^ cells/ml onto 
Falcon 6 well tissue culture plates that had been coated overnight with 0.5 |J.g/ml anti-CD2S 
(Pharmingen) and 3 ug/ml anti-CD3 (0KT3, ATCC) in PBS. After 6 and 24 hours, the cells 
were harvested for RNA preparation. To prepare chronically activated CDS lymphocytes, we 
activated the isolated CDS lymphocytes for 4 days on anti-CD2S and anti-CD3 coated plates and 
then harvested the cells and expanded them in DMEM 5% FCS (Hyclone), 100 \iM non essential 
amino acids (Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 x 10'^ M (Gibco), 
and 10 mM Hepes (Gibco) and IL-2. The expanded CDS cells were then activated again with 
plate bound anti-CD3 and anti-CD2S for 4 days and expanded as before. RNA was isolated 6 
and 24 hours after the second activation and after 4 days of the second expansion culture. The 
isolated NK cells were cultured in DMEM 5% FCS (Hyclone), 100 non essential amino 
acids (Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 x 10"^ M (Gibco), and 10 
mM Hepes (Gibco) and IL-2 for 4-6 days before RNA was prepared. 

To obtain B cells, tonsils were procured from NDRI. The tonsil was cut up with sterile 
dissecting scissors and then passed through a sieve. Tonsil cells were then spun down and 
resupended at 10^ cells/ml in DMEM 5% FCS (Hyclone), 100 jaM non essential amino acids 
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(Gibco), 1 mM sodii 




ivate (Gibco), mercaptoethanol 5.5 



^M(GibcoX and lOmM 



Hepes (Gibco). To activSte the cells, we used PWM at 5 |ig/ml or anti-CD40 (Pharmingen) at 
approximately 10 ^ig/ml and IL-4 at 5-10 ng/ml. Cells were harvested for RNA preparation at 
24,48 and 72 hours. 

To prepare the primary and secondary Thl/Th2 and Trl cells, six-well Falcon plates were 
coated overnight with 10 |ig/ml anti-CD28 (Pharmingen) and 2 |ig/ml 0KT3 (ATCC), and then 
washed twice v^th PBS. Umbilical cord blood CD4 lymphocytes (Poietic Systems, German 



Town, MD) were cultured at 10 -10 cells/ml in DMEM 5% FCS (Hyclone), 100 |iM non 
essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 x lO'^M 
(Gibco), 10 mM Hepes (Gibco) and IL-2 (4 ng/ml). IL-12 (5 ng/ml) and anti-IL4 (1 ^ig/ml) were 
used to direct to Thl, while IL-4 (5 ng/ml) and anti-IFN gamma (1 |ig/ml) were used to direct to 
Th2 and IL-10 at 5 ng/ml was used to direct to Trl. After 4-5 days, the activated Thl, Th2 and 
Trl lymphocytes were washed once in DMEM and expanded for 4-7 days in DMEM 5% FCS 
(Hyclone), 1 00 |iM non essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), 
mercaptoethanol 5.5 x 10"^ M (Gibco), 10 mM Hepes (Gibco) and IL-2 (1 ng/ml). Following 
this, the activated Thl, Th2 and Trl lymphocytes were re-stimulated for 5 days with anti- 
CD28/OKT3 and cytokines as described above, but with the addition of anti-CD95L (1 |ig/ml) to 
prevent apoptosis. After 4-5 days, the Thl, Th2 and Trl lymphocytes were washed and then 
expanded again with IL-2 for 4-7 days. Activated Thl and Th2 lymphocytes were maintained in 
this way for a maximum of three cycles. RNA was prepared fi"om primary and secondary Thl , 
Th2 and Trl after 6 and 24 hours following the second and third activations with plate bound 
anti-CD3 and anti-CD28 mAbs and 4 days into the second and third expansion cultures in 
Interleukin 2. 

The following leukocyte cells lines were obtained from the ATCC: Ramos, EOL-1, KU- 
812. EOL cells were ftirther differentiated by culture in 0.1 mM dbcAMP at 5 xlO^ cells/ml for 
8 days, changing the media every 3 days and adjusting the cell concentration to 5 xlO^ cells/ml. 
For the culture of these cells, we used DMEM or RPMI (as recommended by the ATCC), with 
the addition of 5% FCS (Hyclone), 100 \xM non essential amino acids (Gibco), 1 mM sodium 
pyruvate (Gibco), mercaptoethanol 5.5 x 10"^ M (Gibco), 10 mM Hepes (Gibco). RNA was 
either prepared from resting cells or cells activated with PMA at 10 ng/ml and ionomycin at 1 
|ig/ml for 6 and 14 hours. Keratinocyte line CCD 106 and an airway epithelial tumor line NCI- 



5 



6 



185 
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H292 were also obtaiijgM-om the ATCC. Both were cultured jJj^EM 5% PCS (Hyclone), 
100 i^M non essential a^io acids (Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 
X 10"^ M (Gibco), and 10 mM Hepes (Gibco). CCD 1 106 cells were activated for 6 and 14 hours 
with approximately 5 ng/ml TNF alpha and 1 ng/ml IL-1 beta, while NCI-H292 cells were 
5 activated for 6 and 14 hours with the foUov^ng cytokines: 5 ng/ml IL-4, 5 ng/ml IL-9, 5 ng/ml 
IL-1 3 and 25 ng/ml IFN gamma. 

For these cell lines and blood cells, RNA was prepared by lysing approximately 
10^ cells/ml using Trizol (Gibco BRL). Briefly, 1/10 volume of bromochloropropane (Molecular 

10 Research Corporation) was added to the RNA sample, vortexed and after 10 minutes at room 
temperature, the tubes were spun at 14,000 rpm in a Sorvall SS34 rotor. The aqueous phase was 
removed and placed in a 1 5 ml Falcon Tube. An equal volume of isopropanol was added and left 
at -20 degrees C overnight. The precipitated RNA was spun dovm at 9,000 rpm for 15 min in a 
Sorvall SS34 rotor and washed in 70% ethanol. The pellet was redissolved in 300 |j,l of RNAse- 

15 free water and 35 |il buffer (Promega) 5 jiil DTT, 7 ^xl RNAsin and 8 |il DNAse were added. The 
tube was incubated at 37 degrees C for 30 minutes to remove contaminating genomic DNA, 
extracted once with phenol chloroform and re-precipitated v^th 1/10 volume of 3 M sodium 
acetate and 2 volumes of 100% ethanol. The RNA was spun down and placed in RNAse free 
water. RNA was stored at -80 degrees C. 

20 

The above detailed procedures were carried out to obtain the taqman profiles of the 
clones in question. 

Given below are the Primers and the Taqman results for the following clones: 
58092213.0.36 - Probe Name: Ag809 (Table 9 and Table 10) 
25 29692275.0.1 - Probe Name: Ag2773 (Table 1 1 and Table 12) 

32125243.0.21 - Probe Name: Ag427 (Table 13 and Table 14) 
27455183.0.19 -Probe Name: Agl541 (Table 15 and Table 16, 17, 18) 



Table 8: Primer Design for Probe Ag809 (FCTRl) 



Primer 


Sequences 


TM 


Length 


Start Po 


SEQID 
NO 


Forward 


5'-ATGTGATCTTTGGCTGTGAAGT-3* 


58.7 


22 


337 


24 


Probe 


FAM-5'-CTACCCCATGGCCTCCATCGAGT-3'-TAMRA 


69.4 


23 


365 


25 


Reverse 




59.9 


19 


393 


26 



186 
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5'-GGATGTCCAMCCATCCTT-3' 



TABLE 9: TAQMAN RESULTS FOR FCTRl 



Tissue Name 


r aiiei 

1 


Tissue_Name 


Don aI 

2D 


Tissue_Name 


4D 


Liver 

adenocarcinoma 


79.6 


Normal Colon 
061003 


6.8 


yu/Do occonaary i n i ami- 
CD28/anti-CD3 


2,0 


neari ^leiaij 


A*^ ft 
HO.O 


ooz ly wen 
to Mod Diff 


D. 1 


93769_Secondary Th2_anti- 

Pn9ft/anti PPi'^ 


1 .D 


nancreas 


0 1 
Z. 1 


83220 CC NAT 

/nnn'^ftfiR\ 




93770_Secondary Tr1_anti- 

Pr^Oft/anfi Prr^ 


9 *\ 


Pancreatic ca. 




83221 CC Gr.2 

rectosigmoid 
/ono'^ftfift\ 


u.y 


93573_Secondary Th1_resting 
oay *f-D in il-z 


1 n 

1 .u 


MO renal g'^nQ 




83222 CC NAT 

/or^O'^RRft\ 


i 0 
1 .Z 


93572_Secondary Th2_resting 
oay 't-D in il-z 




Thyroid 


6.5 


83235 CC Mod 
Diff (ODO3920) 


3.8 


93571_Secondary Tr1_resting 
day 4-6 in IL-2 


1.7 


Salivary gland 


12. o 


83236 CC NAT 
(UUUoS^O) 


1.0 


93568_primary Th1_anti- 
ouzc>/anti-uuo 


U.4 


Pituitary gland 


8.7 


83237 CC Gr.2 
ascend colon 
(OD03921) 


6.9 


yooby primary i nz_anti- 
CD28/anti-CD3 


1.5 


brain (Tetai; 


U.U 


83238 CC NAT 

(UUUoyzl ) 


4.U 


93570_primary Trl^anti- 

OUzo/antl-UUo 


z.u 


Brain (whole) 


3.0 


83241 CC from 
rartiai 

Hepatectomy 
(ODO4309) 


1.2 


93565 primary Th1 resting dy 4- 
6 in IL-2 


5.4 


Brain (amygdala) 


2.4 


QQOylO 1 \\/ar MAT 

<3oz4z Liver ina I 
(ODO4309) 


0.6 


yooDo primary i nz resting ay 4- 
6 in IL-2 


3.1 


Brain 

(cerebellum) 


0.0 


87472 Colon 
meis 10 lung 
(OD04451-01) 


4.4 


yooD/ primary in resting ay 4-0 
in IL-2 


0.0 


Brain 

(hippocampus) 


13.0 


ftTylT'i 1 linn MAT 

Of** to Lung IN A 1 
(OD04451-02) 


1.2 


youOl OU40r\A L/U4 

lymphocyte_anti-CD28/anti-CD3 


11.2 


Brain (thalamus) 


3.0 


Normal Prostate 

Vu'loniecn at 
6546-1 


10.2 


yOoDZ L^U40r\L^ L/U4 

lymphocyte_anti-CD28/anti-CD3 


1.2 


Cerebral Cortex 


2.3 


84140 Prostate 

L/ancer 

(OD04410) 


41.8 


yozo 1 \^uo Lympnocyics anii- 
CD28/anti-CD3 


0.9 


opinai coro 




84141 Prostate 

MAT {dPiOAA^CW 
IN A 1 \\JlJyjHH l\J) 


OR 7 


93353_chronic CD8 Lymphocytes 

On/ racfinn H\/ A^f^ in II _0 

zry rcoung uy h-o in il-z 


n n 

U.U 


CNSca. 
MG 


12.1 


87073 Prostate 
(OD04720-01) 


11.0 


Q'^S74 phrnnif f^Dft 1 v/mnhnrvtp^ 
JO»jf*T 01 II 11^ wL^u i_y 1 1 1^1 iv^i^y ICO 

2ry_activated CD3/CD28 


0.6 


CNSca. 
(glio/astro) U- 
118-MG 


100.0 


87074 Prostate 
NAT (OD04720- 
02) 


10.0 


93354 CD4 none 


1.1 


CNS ca. (astro) 
SW1783 


6.5 


Normal Lung 

GENPAK 

061010 


7.9 


93252 Secondary 
Thin'h2/Tr1 anti-CD95CH11 


0.0 


CNS ca.* (neuro; 
met ) SK-N-AS 


52.1 


83239 Lung Met 
to Muscle 


6.5 


93103_LAK ce!ls_resting 


0.5 
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.^0004286) 








CNS ca. (astro) 
SF-539 


12.6™ 


B3240 Muscle 
'NAT (OD04286) 


2.6 


• 

93788 LAK cells IL-2 


0.0 


CNS ca. (astro) 
SNB-75 


11.9 


84136 Lung 
Malignant 
Cancer 
(OD03126) 


14.8 


93787 LAK cells IL-2+IL-12 


0.7 


CNSca. 
(glio)SNB-19 


0.0 


84137 Lung NAT 
(OD03126) 


3.2 


93789_LAK cellsJL-2+IFN 
gamma 


1.1 


CNSca. 
(glio)U251 


0.9 


84871 Lung 

Cancer 

(OD04404) 


2.1 


93790 LAK cells IL-2+ IL-18 


0.3 


CNS ca. (glio) 
SF-295 


12.6 


84872 Lung NAT 
(OD04404) 


1.9 


93104 LAK cells PMA/ionomycin 
and IL-18 


0.0 


Heart 


13.9 


84875 Lung 

Cancer 

(OD04565) 


0.3 


93578_NK Cells IL-2_resting 


1.3 


Skeletal muscle 


3.2 


85950 Lung 

Cancer 

(OD04237-01) 


1.3 


93109_Mixed Lymphocyte 
Reaction_Two Way MLR 


0.5 


Bone marrow 


3.6 


85970 Lung NAT 
(OD04237-02) 


2.6 


93110_Mixed Lymphocyte 
Reaction_Two Way MLR 


0.5 


Thymus 


4.2 


83255 Ocular 
Mel Met to Liver 
(ODO4310) 


0.1 


93111_Mixed Lymphocyte 
Reaction_Two Way MLR 


2.7 


Spleen 


61.6 


83256 Liver NAT 
(ODO4310) 


0.6 


931 12_Mononuclear Cells 
(PBMCsLresting 


0.0 


Lymph node 


3.3 


84139 

Melanoma Mets 
to Lung 
(OD04321) 


2.5 


931 13_Mononuclear Cells 
(PBMCs)_PWM 


1.3 


Colorectal 


11.9 


84138 Lung 
NAT(OD04321) 


2.6 


93114 Mononuclear Cells 
(PBMCs)_PHA-L 


1.0 


Stomach 


28.3 


Normal Kidney 

GENPAK 

061008 


5.6 


93249_Ramos (B cell)_none 


1.2 


Small intestine 


4.5 


83786 Kidney 
Ca, Nuclear 
grade 2 
(OD04338) 


0.6 


93250_Ramos (B cell)Jonomycin 


2.3 


Colon ca. SW480 


46.7 


83787 Kidney 
NAT (OD04338) 


3.7 


93349_B lymphocytes_PWM 


4.3 


Colon ca.* 

(SW480 

met)SW620 


19.0 


83788 Kidney Ca 
Nuclear grade 
1/2 (OD04339) 


0.8 


93350 B lymphoytes CD40L and 
IL-4 


1.4 


Colon ca. HT29 


5.3 


83789 Kidney 
NAT (OD04339) 


3.1 


92665_EOL-1 

(Eosinophil)_dbcAMP 

differentiated 


7.2 


Colon ca. HCT- 
116 


5.0 


83790 Kidney 
Ca, Clear cell 
type (OD04340) 


1.5 


93248_EOL-1 

{Eosinophil)_dbcAMP/PMAionom 
ycin 


3.0 


Colon ca. CaCo-2 


49.3 


83791 Kidney 
NAT (OD04340) 


5.1 


93356 Dendritic Cells none 


1.5 


83219 CC Well to 
Mod Diff 
(OD03866) 


3.0 


83792 Kidney 
Ca, Nuclear 
grade 3 
(OD04348) 


14.5 


93355_Dendritic Cells_LPS 100 
ng/ml 


0.7 


Colon ca. HCC- 
2998 


27.7 


83793 Kidney 
NAT (OD04348) 


2.5 


93775 Dendritic Cells anti-CD40 


0.5 


Gastric ca.* (liver 
met) NCI-N87 


10.5 


87474 Kidney 
Cancer 


1.7 


93774_Monocytes_resting 


0.5 
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^(OD04622-01) 




0 




Bladder 


3.7 


P7475 Kidney 
^NAT (OD04622- 
03) 


2.0 


93776_Monocytes_LPS 50 ng/ml 


0.0 


Trachea 


23.5 


85973 Kidney 

Cancer 

(OD04450-01) 


0.3 


93581_Macrophages_resting 


1.3 


Kidney 


1.8 


85974 Kidney 
NAT 

(OD04450-03) 


2.0 


93582_Macrophages_LPS 100 
ng/ml 


1.8 


Kidney (fetal) 


1.9 


Kidney Cancer 

Clontech 

8120607 


7.0 


93098_HUVEC 
(EndotheliaI)_none 


2.3 


Renal ca. 786-0 


7.0 


Kidney NAT 

Clontech 

8120608 


1.5 


93099_HUVEC 
{Endothelial)_starved 


9.0 


Renal ca. A498 


6.8 


Kidney Cancer 

Clontech 

8120613 


2.0 


93100 HUVEC (Endothelial) IL- 
1b 


1.2 


Renal ca.RXF 
393 


4.7 


Kidney NAT 

Clontech 

8120614 


4.1 


93779„HUVEC (Endothelial)JFN 
gamma 


1.4 


Renal ca.ACHN 


9.8 


Kidney Cancer 

Clontech 

9010320 


2.2 


93102 HUVEC 
(Endothelial)„TNF alpha + IFN 
gamma 


0.8 


Renal ca.UO-31 


1.3 


Kidney NAT 

Clontech 

9010321 


3.5 


93101 HUVEC 
(Endothelial)_TNF alpha + IL4 


1.1 


Renal ca.TK-10 


0.6 


Normal Uterus 

GENPAK 

061018 


3.1 


93781 HUVEC (Endothelial) IL- 
11 


3.0 


Liver 


0.8 


Uterus Cancer 

GENPAK 

064011 


17.6 


93583_Lung Microvascular 
Endothelial Cells none 


0.8 


Liver (fetal) 


1.1 


Normal Thyroid 
Clontech A+ 
6570-1 


3.7 


93584_Lung Microvascular 
Endothelial Cells TNFa (4 ng/ml) 
and IL1b(1 ng/ml) 


0.5 


Liver ca. 

(hepatoblast) 

HepG2 


54.0 


Thyroid Cancer 

GENPAK 

064010 


1.2 


92662_Microvascular Dermal 
endothelium none 


1.1 


Lung 


3.9 


Thyroid Cancer 

INVITROGEN 

A302152 


0.6 


92663_Microsvasular Dermal 
endothelium TNFa (4 ng/ml) and 
IL1b(1 ng/ml) 


1.0 


Lung (fetal) 


9.0 


Thyroid NAT 

INVITROGEN 

A302153 


2.6 


93773_Bronchial 

epithelium TNFa (4 ng/ml) and 

IL1b(1 ng/ml)** 


0.0 


Lung ca. (small 
cell) LX-1 


34.4 


Normal Breast 

GENPAK 

061019 


3.4 


93347_Small Ainway 
Epithelium_none 


0.4 


Lung ca. (small 
cell) NCI-H69 


3.0 


84877 Breast 

Cancer 

(OD04566) 


0.9 


93348_Small Airway 
Epithelium TNFa (4 ng/ml) and 
IL1b(1 ng/ml) 


0.5 


Lung ca. (s.cell 
var.) SHP-77 


13.0 


85975 Breast 

Cancer 

(OD04590-01) 


67.8 


92668_Coronery Artery 
SMC_resting 


5.8 


Lung ca. (large 
cell)NCI-H460 


6.8 


85976 Breast 
Cancer Mets 
(OD04590-03) 


51.1 


92669 Coronery Artery 
SMC_TNFa (4 ng/ml) and ILIb (1 
ng/ml) 


2.3 


Lung ca. (non- 
sm. cell) A549 


3.4 


87070 Breast 

Cancer 

Metastasis 


12.7 


93 1 07_astrocytes_resting 


2.7 
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,^0004655-05) 










Lung ca. (non- 
s.cell) NCI-H23 


34.4^ 


■BENPAK Breast 
Cancer 064006 


8.9 


93108_astra^p6s_TNFa (4 

ng/ml) and IL1b (1 ng/ml) 


0.0 








Breast Cancer 










Lung ca (non- 
s.cell) HOP-62 


10.5 


Clontech 
9100266 


6.2 


92666_KU-812 (Basophil)_resting 


6.8 








Breast NAT 










Lung ca. (non- 
s.cl) NCI-H522 


47,6 


Clontech 
9100265 


3.3 


92667_KU-812 
{Basophil)_PMA/ionoycin 


8.4 




Lung ca. 
(squann.) SW 
900 


4.7 


Breast Cancer 
INVITROGEN 
A209073 


3.4 


93579_CCD1106 
{Keratinocytes)_none 


1.6 




Lung ca. 
(squam.) NCI- 
H596 


0.7 


Breast NAT 

INVITROGEN 

A2090734 


8.7 


93580_CCD110d 
(Keratmocytes)_TNFa and IFNg 


A A 

1.4 








Normal Liver 














GENPAK 










Mammary gland 


9.9 


061009 


1.1 


93791 Liver Cirrhosis 


4.2 








Liver Cancer 










Breast ca.* (pi. 




GENPAK 










effusion) MCF-7 


5.6 


064003 


0.6 


93792_Lupus Kidney 


A f\ 

1.9 








Liver Cancer 














Research 










Breast ca.* (pi.ef) 
MDA-MB-231 


21.3 


Genetics RNA 
1025 


0.6 


93577 NCI-H292 


39.5 


fri 






Liver Cancer 
Research 








r\ 


Breast ca.*(pl. 




Genetics RNA 








ri 


effusion) T47D 


66.0 


1026 


1.4 


93358 NCI-H292 IL-4 


39.0 


rsr 






Paired Liver 








,r\ 






Cancer Tissue 














Research 










Breast ca. BT- 




Genetics RNA 










549 


7.6 


6004-T 


1.3 


^ A A A ft i III ^\^\^\ 1 1 ^\ 

93360 NCI-H292 IL-9 


65.5 








Paired Liver 














Tissue Research 








f=^ 






Genetics RNA 








; 55 


Breast ca.MDA-N 


18.7 


6004-N 


1.3 


93359 NCI-H292 IL-13 


37.1 


rl 

•ess- 






Paired Liver 














Cancer Tissue 
Research 
Genetics RNA 










Ovary 


12.1 


6005-T 


1.1 


93357_NCI-H292_IFN gamma 


31.9 








Paired Liver 














Tissue Research 










Ovarian 




Genetics RNA 










ca.OVCAR-3 


3.5 


6005-N 


0.3 


93777 HPAEC - 


0.5 








Normal Bladder 










Ovarian 




GENPAK 




93778_HPAEC_IL-1 beta/TNA 






ca.OVCAR-4 


4.0 


061001 


5.9 


alpha 


1.2 








Bladder Cancer 














Research 










Ovarian ca. 




Genetics RNA 




93254_Normal Human Lung 






OVCAR-5 


9.1 


1023 


1.7 


Fibroblast none 


42.3 








Bladder Cancer 




93253_Normal Human Lung 






Ovarian ca. 




INVITROGEN 




Fibroblast_TNFa (4 ng/ml) and IL- 






OVCAR-8 


12.7 


A302173 


1.9 


lb (1 ng/ml) 


A^ O 

17.8 








87071 Bladder 










Ovarian 




Cancer 




)yo£,0( iNormai numdn Luny 






ca.lGROV-1 


9.8 


(OD04718-01) 


2.0 


Fibroblast IL-4 


100.0 




Ovarian ca.* 


0.4 


87072 Bladder 


3.3 


93256_Normal Human Lung 


72.7 



15966-697 



(ascites) SK-OV- 
3 


^Uvlormal Adjacent 
^■[000471 8-03) 




Fibroblast_^^ 




Uterus 


6.9 


Normal Ovary 
Res. Gen. 


2.2 


93255 Normal Human Lung 
Fibroblast IL-13 


60.7 


Plancenta 


4.6 


Ovarian Cancer 

GENPAK 

064008 


29.1 


93258_Normal Human Lung 
FibroblastJFN gamma 


81.8 


Prostate 


15.7 


87492 Ovary 

Cancer 

(OD04768-07) 


100.0 


93106_Dermal Fibroblasts 
CCD1070_resting 


76.8 


Prostate ca.* 
(bone met)PC-3 


35.9 


87493 Ovary 
NAT {OD04768- 
08) 


2.2 


93361 Dermal Fibroblasts 
CCD1070_TNF alpha 4 ng/ml 


30.2 


Testis 


14.6 


Normal Stomach 

GENPAK 

061017 


13.1 


93105 Dermal Fibroblasts 
CCD1070JL-1 beta 1 ng/ml 


38.2 


Melanoma 
Hs688(A).T 


13.5 


NAT Stomach 

Clontech 

9060359 


8.8 


93772_clermal fibroblastJFN 
gamma 


34.2 


Melanoma* (met) 
Hs688(B).T 


71.2 


Gastric Cancer 

Clontech 

9060395 


2.5 


93771 dermal fibroblast IL-4 


80.7 


Melanoma 
UACC-62 


1.7 


NAT Stomach 

Clontech 

9060394 


9.7 


93259_IBD Colitis 1** 


0.0 


Melanoma M14 


9.5 


Gastric Cancer 

Clontech 

9060397 


15.9 


93260_IBD Colitis 2 


0.3 


Melanoma LOX 
IMVI 


2.4 


NAT Stomach 
Clontech 

9060396 


12.9 


93261_IBD Crohns 


1.4 


Melanoma* 
fmGt^SK-MEL-5 


3.4 


OaSiriC OaHUcr 

GENPAK 
064005 


12.1 


Colon normal 


35.6 


Adioose 


5.9 






735019 Lung none 


11.0 










64028-1_Thymus_none 


5.8 










64030-1_Kidney_none 


9.7 



Taqman results shown in Table 9 demonstrates that cFCTRl is highly expressed by 
tumor cell lines and also overexpressed in tumor tissues, specifically breast and ovarian tumor I 
compared to Normal Adjacent Tissues (NAT). There are reports that foUistatin can act as a 
5 modulator of tumor growth and its expression also correlate with polycystic ovary syndrome, a 
benign form of ovarian tumor. 



Table 10: Primer Design for Probe Ag2773 (FCTR4) 



Primer 


Sequences 


TM 


Length 


Start Po 


SEQ ID 
NO 


Forward 


5'-CCTTGCTTTGTCATATGCTGTT-3' 


59.3 


22 


243 


29 


Probe 


FAM-5'-CCCTTTGCCTGGAATATAAACTCTCA-3'-TAMRA 


64.6 


26 


265 


30 


Reverse 


5'-AGAGGAAGCTTTCTGGAGAAGA-3' 


58.9 


22 


313 


31 



191 
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TAfiU; 11: TAQMAN RESULTS FOR CHNE FCTR4 



1 issuG_iManie 


Panel 

1 LI 


1 i5sue_i>iarTie 


Panel 
on 


1 issue iMame 


Panel 


Liver 

3aenoc3rcinom3 




Normal Colon 
otlNrAix U0IUU0 


A'i 0 


93768_Secondary Th1_anti- 
uUiCo/anu-uuo 


10 7 


rieart (leiai) 




83219 CC Well to 
Mod Diff 

(UUUOODD) 


O.Z 


93769_Secondary Th2_anti- 
L/U^o/anii-LrUo 


1/1 0 


Pancreas 


O.I 


83220 CC NAT 

(UUUOOOD) 




93770_Secondary Tr1_anti- 
uuzo/anii-uuo 


1 >1 "7 
14. / 


Pancreatic 
ca.CAPAN 2 


20.0 


83221 CC Gr.2 

rectosigmoid 

(OD03868) 


0.7 


93573 Secondary Thi resting day 4- 
6 in IL-2 


4.7 


Adrenal gland 


7-4 


QOOOO KIAT 

L,L> NA 1 
(OD03868) 


1.4 


93572 Secondary Th2 resting day 4- 
6 in IL-2 


3-5 


Fnyroid 


D.o 


83235 CC Mod 

UITT (OUUoS^^O) 


A A n 
14.0 


93571_Secondary Tr1_resting day 4- 
D in iL-z 




Salivary gland 




83236 CC NAT 
(UDUo920) 


Hon 
13.9 


93568_primary Th1_anti-CD28/anti- 
CDo 


2Z.4 


Pituitary gland 


5,7 


83237 CC Gr.2 
ascend colon 
(OD03921) 


16.2 


93569 jDrimaryTh2 anti-CD28/anti- 
CD3 


16.3 


Brain (fetal) 


14.4 


83238 CC NAT 
(OD03921) 


5.2 


93570 primary Tr1 anti-CD28/anti- 
CD3 


21.8 


Brain (whole) 


19.6 


83241 CC from 
Partial 

Hepatectomy 
(ODU4309) 


13.9 


93565_primary Th1_resting dy 4-6 in 
IL-2 


30.2 


Brain 

(amygdala) 


3.7 


83242 Liver NAT 

(ODO4309) 


12.7 


93566_primary Th2_resting dy 4-6 in 
IL-2 


AAA 

14.4 


Brain 

(cerebellum) 


2.1 


87472 Colon 
mets to lung 
(UD04451-01) 


0 A 

3.4 


93567_primary Tr1_resting dy 4-6 in 
IL-2 


7.4 


Brain 

(hippocampus) 


22.7 


87473 Lung NAT 
(OD04451-02) 


1.5 


93351_CD45RA CD4 
lymphocyte_anti-CD28/anti-CD3 


7.6 


Brain (thalamus) 


7.4 


Normal Prostate 
Clontech A+ 
d54d-1 


*! n 
1.0 


93352_CD45RO CD4 

iympnocyte_anti-oiJzo/anti-L/Uo 


111 


Uerebral Cortex 


47.3 


84140 Prostate 
Cancer 

tr\T\nA A ^ n\ 
(UD04410) 


0.1 


93251_CD8 Lymphocytes_anti- 

OU2o/anu-UDo 


y.b 


Spinal cord 


8.3 


84141 Prostate 
NAT (OD04410) 


10.6 


93353_chronic CD8 Lymphocytes 
2ry_resting dy 4-6 in IL-2 


9.7 


CNS ca. 
(glio/astro)U87- 

IVIo 


■1 Q Q 


07rt70 n ^^^t^i^ 

87073 Prostate 
Cancer 

ir\r\nA7or\ n'i\ 


y. / 


93574_chronic CDS Lymphocytes 
zry_aciivaiea ouo/uuzo 


o.z 


CNSca. 
(glio/astro) U- 

1 lO-IVIo 


O/^ .u 


87074 Prostate 
NAT (OD04720- 


0.0 


OU4 none 


a A 

0,H 


CNS ca. (astro) 

CXA/i TOO 


'inn 


Normal Lung 


OD.b 


93252__Secondary Th1/Th2/Tr1_anti- 


y.o 


CNS ca.* 
(neuro; met )SK- 

IN -Mo 


AA ft 


83239 Lung Met 
to Muscle 


\ \ .f 


QQ1rt*3 1 A i^^llo roof inn 

lUo LMr\ ceiis_resiing 


1 1 n 


CNS ca. (astro) 
SF-539 


37.4 


83240 Muscle 
NAT (OD04286) 


3.4 


93788 LAK cells IL-2 


10.4 


CNS ca. (astro) 
SNB-75 


62.0 


84136 Lung 
Malignant Cancer 
{OD03126) 


15.1 


93787 LAK cells IL-2+IL-12 


7.4 



15966-697 



CNS ca. (glio) 
SNB-19 


24.8 


J^7 Lung NAT 
nB3126) 


17.4 


93789_LAK JB^IL-2+IFN gamma 


11.6 


CNS ca. (glio) 
U251 


40.3 


8TO71 Lung 

Cancer 

(OD04404) 


5.0 


93790 LAK cells IL-2+ IL-18 


13.3 


CNS ca. (glio) 
SF-295 


100.0 


84872 Lung NAT 
(OD04404) 


6.3 


93104 LAK cells PMA/ionomycin 
and IL-18 


4.8 


Heart 


0.0 


84875 Lung 

Cancer 

(OD04565) 


3.2 


93578_NK Cells IL-2_resting 


6.2 


Skeletal muscle 


0.0 


85950 Lung 

Cancer 

(OD04237-01) 


15.8 


93109_Mixed Lymphocyte 
Reaction_Two Way MLR 


12.3 


Bone marrow 


33.7 


85970 Lung NAT 
(OD04237-02) 


10.5 


93110_Mixed Lymphocyte 
Reaction_Two Way MLR 


8.7 


Thymus 


12.4 


83255 Ocular 
Mel Met to Liver 
(ODO4310) 


5.9 


931 1 1_Mixed Lymphocyte 
Reaction_Two Way MLR 


3.5 


Spleen 


21.3 


83256 Liver NAT 
(ODO4310) 


3.6 


93112_Mononuclear Cells 
(PBMCs)_resting 


4.5 


Lymph node 


13.4 


84139 Melanoma 
Mets to Lung 
(OD04321) 


10.6 


93113_Mononuclear Cells 
(PBMCs)_PWM 


21.2 


Colorectal 


38.2 


84138 Lung NAT 
(OD04321) 


10.6 


93114_Mononuclear Cells 
(PBMCs)_PHA-L 


8.9 


Stomach 


9.9 


Normal Kidney 
GENPAK 061008 


26.2 


93249_Ramos (B cell)_none 


100.0 


Small intestine 


17.9 


83786 Kidney 
Ca, Nuclear 
grade 2 
(OD04338) 


22.2 


93250_Ramos (B cell)_ionomycm 


28.7 


Colon 
ca.SW480 


27.7 


83787 Kidney 
NAT (OD04338) 


11.7 


93349_B lymphocytes_PWM 


20.0 


Colon ca.* 

(SW480 

met)SW620 


30.8 


83788 Kidney Ca 
Nuclear grade 
1/2 (OD04339) 


45.1 


93350_B lymphoytes_CD40L and IL- 
4 


7.8 


Colon ca.HT29 


8.1 


83789 Kidney 
NAT (OD04339) 


14.8 


92665_EOL-1 (Eosinophil)_dbcAMP 
differentiated 


8.0 


Colon ca.HCT- 
116 


35.4 


83790 Kidney 
Ca, Clear cell 
type (OD04340) 


26.6 


93248_EOL-1 

(Eosinophil)_dbcAMP/PMAionomycin 


3.8 


Colon ca. CaCo- 
2 


37.6 


83791 Kidney 
NAT (OD04340) 


10.4 


93356 Dendritic Cells none 


6.8 


83219 CC Well 
to Mod Diff 
(OD03866) 


17.8 


83792 Kidney 
Ca, Nuclear 
grade 3 
(OD0.4348) 


2.4 


93355_Dendntic Cells_LPS 100 ng/ml 


3.3 


Colon ca.HCC- 
2998 


19.9 


83793 Kidney 
NAT (OD04348) 


18.8 


93775 Dendritic Cells anti-CD40 


6.3 


Gastric ca.* 
(liver met) NCI- 
N87 


73.2 


87474 Kidney 

Cancer 

(OD04622-01) 


5.6 


93774_Monocytes_resting 


10.6 


Bladder 


43.2 


87475 Kidney 
NAT (OD04622- 
03) 


0.5 


93776_Monocytes_LPS 50 ng/ml 


3.5 


Trachea 


10.3 


85973 Kidney 

Cancer 

(OD04450-01) 


21.2 


93581_Macrophages_resting 


7.6 


Kidney 


9.2 


85974 Kidney 
NAT (OD04450- 
03) 


9.3 


93582_Macrophages_LPS 100 ng/ml 


3.9 
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Kidney (fetal) 


0.0 


^H^ey Cancer 
^Bitech 

OT20607 


0.0 


m 

93098„HUVEC {Endothelial)_none 


8.5 


Renal ca.786-0 


53.6 


Kidney NAT 

Clontech 

8120608 


0.9 


93099_HUVEC (Endothelial)_starved 


17.9 


Renal ca. A498 


36.1 


Kidney Cancer 

Clontech 

8120613 


0.0 


93100_HUVEC (Endothelial)JL-1b 


6.0 


Renal ca.RXF 
393 


31.6 


Kidney NAT 

Clontech 

8120614 


0.9 


93779_HUVEC (Endothelial)JFN 
gamma 


7.8 


Renal ca.ACHN 


21.6 


Kidney Cancer 

Clontech 

9010320 


2.7 


93102_HUVEC (EndothelialLTNF 
alpha + IFN gamma 


5.7 


Renal ca.UO-31 


28.7 


Kidney NAT 

Clontech 

9010321 


5.0 


93101_HUVEC {Endothelial)_TNF 
alpha + IL4 


5.6 


Renal ca.TK-10 


7.0 


Normal Uterus 
GENPAK 061018 


5.3 


93781_HUVEC (Endothelial)JL-11 


4.9 


Liver 


14.2 


Uterus Cancer 
GENPAK 06401 1 


9.0 


93583_Lung Microvascular 
Endothelial Cells_none 


4.9 


Liver (fetal) 


14.5 


Normal Thyroid 
Clontech A+ 
6570-1 


3.4 


93584_Lung Microvascular 
Endothelial Cells TNFa (4 ng/ml) and 
IL1b(1 ng/ml) 


4.9 


Liver ca. 

(hepatoblast) 

HepG2 


59.9 


Thyroid Cancer 
GENPAK 064010 


1.8 


92662_Microvascular Dermal 
endothelium none 


8.6 


Lung 


17.8 


Thyroid Cancer 

INVITROGEN 

A302152 


3.6 


92663_Microsvasular Dermal 
endothelium TNFa (4 ng/ml) and IL1b 
(1 ng/ml) 


6.0 


Lung (fetal) 


9.6 


Thyroid NAT 

INVITROGEN 

A302153 


4.9 


93773 Bronchial epithelium_TNFa (4 
ng/ml) and IL1b{1 ng/ml) ** 


0.9 


Lung ca. (snnall 
cell) LX-1 


70.2 


Normal Breast 
GENPAK 061019 


8.5 


93347_Small Ainway Epithelium_none 


1.3 


Lung ca. (small 
cell) NCI-H69 


29.9 


84877 Breast 

Cancer 

(OD04566) 


1.5 


93348_Small Ainway 

Epithelium TNFa (4 ng/ml) and IL1b 

(1 ng/ml) 


13.2 


Lung ca. (s.cell 
var.) SHP-77 


3.9 


85975 Breast 

Cancer 

(OD04590-01) 


23.8 


92668_Coronery Artery SMC_resting 


3.4 


Lung ca. (large 
cell)NCI-H460 


2.0 


85976 Breast 
Cancer Mets 
(OD04590-03) 


24.5 


92669 Coronery Artery SMC TNFa 
(4 ng/ml) and IL1b(1 ng/ml) 


2.0 


Lung ca. (non- 
sm. cell) A549 


28.5 


87070 Breast 
Cancer 
Metastasis 
(OD04655-05) 


12.9 


931 07_astrocytes_resting 


4.7 


Lung ca. (non- 
s.cell) NCI-H23 


36.1 


GENPAK Breast 
Cancer 064006 


11.8 


93108 astrocytes TNFa (4 ng/ml) 
andlL1b(1 ng/ml) 


1.9 


Lung ca (non- 
s.cell) HOP-62 


29.9 


Breast Cancer 

Clontech 

9100266 


3.2 


92666_KU-812 {Basophil)„resting 


5.8 


Lung ca. (non- 
s.cl) NCI-H522 


17.2 


Breast NAT 

Clontech 

9100265 


1.8 


92667_KU-812 
(Basophil)_PMA/ionoycin 


12.0 


Lung ca. 
(squam.) SW 
900 


63.7 


Breast Cancer 
INVITROGEN 
A209073 


11.0 


93579_CCD1106 
(Keratinocytes) none 


4.9 


Lung ca. 


10.0 


Breast NAT 


7.1 


93580 CCD1106 


0.3 
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(squam.) NCI- 
H59d 




^^■90734 




(Keratinocyte^^BJFa and IFNg 








Normal Liver 








Mammary gland 


4.6 


GENPAK 061009 


8.8 


93791 Liver Cirrhosis 


1.8 


Breast ca.* (pi. 












effusion) MCF- 
7 


0,0 


Liver Cancer 
GENPAK 064003 


4,9 


93792_Lupus Kidney 


1,6 






Liver Cancer 








Breast ca.* 




Research 








(pi.ef) MDA-MB- 




Genetics RNA 








231 


38.7 


1025 


1,0 


93577_NCI-H292 


11.1 






Liver Cancer 












Research 








Breast ca.* (pi. 




Genetics RNA 








effusion) T47D 


0.0 


1026 


0.8 


93358 NCI-H292 IL-4 


12.2 






Paired Liver 












Cancer Tissue 












Research 








Breast ca.BT- 




Genetics RNA 








549 


4.6 


6004-T 


3.0 


93360 NCI-rl292 IL-9 


7.6 






Paired Liver 












Tissue Research 








Breast ca.MDA- 




Genetics RNA 








N 


19.0 


6004-N 


7.3 


93359 NCI-H292 IL-13 


6.1 






Paired Liver 












Cancer Tissue 












Research 












Genetics RNA 








Ovary 


1.7 


6005-T 


0.2 


93357_NCI-H292JFN gamma 


5.8 






Paired Liver 












Tissue Research 








Ovarian 




Genetics RNA 








ca.OVCAR-3 


4.8 


6005-N 


0.0 


r\ O ~7~7~7 11 n A C~ 


C Q 

D.O 


Ovarian 




Normal Bladder 








ca.OVCAR-4 


0.0 


GENPAK 061001 


19.8 


93778_HPAEC_IL-1 beta/TNA alpha 


5.4 






Bladder Cancer 












Research 








Ovarian 




Genetics RNA 




^ ^ ^\ ^^Al lit 1 

93254_Normal Human Lung 




ca.OVCAR-5 


39.0 


1023 


3.1 


Fibroblast none 


2.1 






Bladder Cancer 




93253_Normal Human Lung 




Ovarian 




INVITROGEN 




1 II L 1 l~ t A 1 l\ -111 Jk 

Fibroblast_TNFa (4 ng/ml) and IL-lb 




ca.OVCAR-8 


36.6 


A302173 


9.9 


(1 ng/ml) 


1.9 






87071 Bladder 








Ovarian 




Cancer 




^ ^ ^ ^ ^9 A 1 lit 1 

93257 Normal Human Lung 




ca.lGROV-1 


0.0 


{OD04718-01) 


6.6 


FibroblastJL-4 


3.6 


Ovarian ca.* 




87072 Bladder 








(ascites) SK- 
OV-3 


65.5 


Normal Adjacent 
(OD0471O-03) 


4.0 


93256_Normal Human Lung 
Fibroblast iL-9 


o o 
o.o 


Uterus 


1.6 


Normal Ovary 
Res. Gen. 


0.3 


93255_Normal Human Lung 
Fibroblast IL-13 








Ovarian Cancer 




93258_Normal Human Lung 




Plancenta 


8.9 


GENPAK 064008 


6,8 


Fibroblast J FN gamma 


2.9 






87492 Ovary 
Cancer 




93106_Dermal Fibroblasts 




Prostate 


0.0 


(OD04768-07) 


100.0 


CCD1070_resting 


5.6 


Prostate ca.* 




87493 Ovary 
NAT (OD04768- 




93361_Dermal Fibroblasts 




(bone met)PC-3 


9.2 


08) 


3.6 


CCD1070_TNF alpha 4 ng/ml 


17.4 






Normal Stomach 




yoiuo uermai rioroDiasis 




Testis 


29.5 


GENPAK 061017 


8.6 


CCD1070JL-1 beta 1 ng/ml 


3.8 


Melanoma 


14.3 


NAT Stomach 


0.7 


93772_dermal fibroblastJFN gamma 


2.6 
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Hs688(A).T 




J||ntech 
■P)359 




• 




Melanoma* 
(met) 

Hs688(B).T 


22.9 


^stric Cancer 
Clontech 
9060395 


3.9 


93771 dermal fibroblast IL-4 


3.4 


Melanoma 
UACC-62 


9.7 


NAT Stomach 

Clontech 

9060394 


5.3 


93259 IBD Colitis 1** 


0.2 


Melanoma M14 


12.7 


Gastric Cancer 

Clontech 

9060397 


13.2 


93260 IBD Colitis 2 


0.4 


Melanoma LOX 
IMVI 


4.5 


NAT Stomach 

Clontech 

9060396 


1.1 


93261_IBD Crohns 


0.3 


Melanoma* 
fmet) SK-MEL-5 


21.8 


Gastric Cancer 
GENPAK 064005 


23.0 


735010 Colon normal 


3.3 


Adipose 


6.7 






73501 9_Lung_none 


3.9 










64028-1 _Thymus_none 


7.7 










64030-1_Kidney_none 


21.8 



Table 12 shows the taqman results of clone FCTR4 indicating overexpression in ovarian cancer as 
compared to Normal Adjacent Tissue (NAT). In addition, increased expression is demonstrated by ovarian 
tumor cell line suggesting that antibodies could be used to treat ovarian tumors. 
5 

Table 13: Primer Design for Probe Ag427 (FCTR5) 



Primer 


Sequences 


Length 


Start Po 


SEQID 
NO 


Forward 


5'-GAGCTACAGGCAGCCTCGAGT-3' 


21 


443 


32 


Probe 


TET-5'-TGGCCCAGCTGACCCTGCTCA-3'-TAMRA 


21 




33 


Reverse 


5'-GGCTACGTCAGTGGGTTTGG-3' 


20 


449 


34 



Table 14: Taqman results for FCTR5 

10 



Tissue_Name 


Panel 1 


Tissue_Name 


Panel 4D 


Endothelial cells 


10.7 


93768_Secondary Th1_anti-CD28/anti-CD3 


15.9 


Endothelial cells (treated) 


15.2 


93769_Secondary Th2„anti-CD28/anti-CD3 


14.7 


Pancreas 


16.2 


93770_Secondary Tr1_anti-CD28/anti-CD3 


21.9 


Pancreatic ca.CAPAN 2 


10,5 


93573 Secondary Thi resting day 4-6 in 
IL-2 


12.3 


Adipose 


45.1 


93572 Secondary Th2 resting day 4-6 in 
IL-2 


16.2 


Adrenal gland 


61.6 


93571 Secondary Tr1 resting day 4-6 in IL- 
2 


16.2 


Thyroid 


13.1 


93568_primary Th1_anti-CD28/anti-CD3 


13.9 



196 
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Salavary gland 


33.7 


93569jDrimary Th2_anti^fc8/anti-CD3 


14.6 




Pituitary gland 


15.8 


93570_pnmary Tr1_anti-lBE8/anti-CD3 


26.2 




Brain (fetal) 


7.2 


93565jDnmary Th1_resting dy 4-6 in IL-2 


56.3 




Brain (whole) 


6.3 


93566_primary Th2_resting dy 4-6 in IL-2 


27.7 




Brain (amygdala) 


8.4 


93567_primary Tr1_resting dy 4-6 in IL-2 


31.6 








93351_CD45RA CD4 lymphocyte_anti- 






Brain (cerebellunn) 


6.8 


CD28/anti-CD3 


12.1 








93352_CD45RO CD4 lymphocyte_anti- 






Brain (hippocannpus) 


7.9 


CD28/anti-CD3 


17.1 








93251_CD8 Lymphocytes_anti-CD28/anti- 






Brain (substantia nigra) 


9.5 


CD3 


9.1 








93353_chronic CD8 Lymphocytes 






Brain (thalannus) 


7.9 


2ry_resting dy 4-6 in IL-2 


13.4 








93574_chronic CDS Lymphocytes 






Brain (hypothalamus) 


23.0 


2ry_activated CD3/CD28 


9.2 




Spinal cord 


9.5 


93354 CD4 none 


7.6 








93252_Secondary Th1/Th2/Tr1_anti-CD95 






CNS ca. (glio/astro)U87-MG 


12.6 


CH11 


20.2 




CNSca. (glio/astro)U-118- 










MG 


11.6 


93103_LAK cells_resting 


57.0 




CNSca. (astro)SW1783 


4.3 


93788 LAK cells IL-2 


18.8 




CNS ca.* (neuro; met )SK-N- 










AS 


10.4 


93787 LAK cells IL-2+IL-12 


14.2 




CNS ca, (astro) SF-539 


11.6 


93789_LAK cellsJL-2+IFN gamma 


20.9 




CNS ca. (astro) SNB-75 


4.4 


93790_LAK cellsJL-2+ IL-18 


14.8 




CNS ca. (glio)SNB-19 


31.6 


93104_LAK cells_PMA/ionomycin and IL-18 


12.9 




CNS ca. (glio)U251 


17.3 


93578_NK Cells IL-2_resting 


17.4 








93109_Mixed Lymphocyte Reaction_Two 




r.^ 


CNS ca. (glio)SF-295 


20.9 


Way MLR 


43.5 








93110_Mixed Lymphocyte Reaction_Two 






Heart 


14.3 


Way MLR 


19.3 








93111_Mixed Lymphocyte Reaction_Two 






Skeletal muscle 


11.7 


Way MLR 


12.6 




Bone marrow 


21.9 


93112_Mononuclear Cells (PBMCs)„resting 


8.7 




Thymus 


20.9 


93113_Mononuclear Cells (PBMCs)_PWM 


28.5 


Q 


Spleen 


23.8 


93114_Mononuclear Cells (PBMCs)_PHA-L 


26.2 




Lymph node 


24.2 


93249_Ramos (B cell)_none 


0.3 


f!l 


Colon (ascending) 


17.2 


93250_Ramos (B cell)Jonomycin 


1.2 




Stomach 


11.1 


93349_B lymphocytes_PWM 


25.7 




Small intestine 


21.5 


93350_B lymphoytes_CD40L and IL-4 


13.0 








92665_EOL-1 (Eosinophil)_dbcAMP 






Colon ca.SW480 


12.2 


differentiated 


26.4 




Colon ca.* (SW480 




93248_EOL-1 






met)SW620 


8.6 


(Eosinophil)_dbcAMP/PMAionomycin 


11.4 




Colon ca.HT29 


16.2 


93356_Dendntic Cells_none 


40.3 




Colon ca.HCT-116 


8.1 


93355_Dendritic Cells_LPS 100 ng/ml 


33.0 




Colon ca.CaCo-2 


22.1 


93775 Dendritic Cells anti-CD40 


20.5 




Colon ca.HCT-15 


18.6 


93774_Monocytes_resting 


23.3 




Colon ca.HCC-2998 


21.9 


93776_Monocytes_LPS 50 ng/ml 


6.9 




Gastric ca.* (liver met) NCI- 










N87 


42.9 


93581_Macrophages_resting 


14.7 




Bladder 


95.3 


93582_Macrophages_LPS 100 ng/ml 


64.6 




Trachea 


18.3 


93098_HUVEC (Endothelial)_none 


6.8 




Kidney 


25.7 


93099_HUVEC (Endothelial)_starved 


13.9 




Kidney (fetal) 


15.8 


93100_HUVEC (Endothelial)JL-lb 


7.5 




Renal ca. 786-0 


16.5 


93779_HUVEC (Endothelial)_IFN gamma 


27.7 








93102_HUVEC (Endothelial)_TNF alpha + 






Renal ca.A498 


16.5 


IFN gamma 


11.8 








93101 HUVEC (Endothelial) TNF alpha + 






Renal ca.RXF 393 


7.4 


IL4 


6.7 



15966-697 



Renal ca.ACHN 


11.9 


93781_HUVEC (Endoth^^^lL-11 


A f\ A 

10.4 


Renal ca.UO-31 


15.8 


93583_Lung Microvascu^Pmdothelial 

Cells none 


8.8 


Renal ca.TK-IO 


28.7 


93584 Lung Microvascular Endothelial 
Cells_TNFa (4 ng/ml) and IL1b (1 
ng/ml) 


o.b 


Liver 


100.0 


92662_Microvascular Dermal 
endothelium none 


22.1 


Liver (fetal) 


81.8 


92663_Microsvasular Dermal 

endothelium_TNFa (4 ng/ml) and ILIb 

/A - - t i\ 

(1 ng/ml) 


18.7 


Liver ca. (hepatoblast) HepG2 


28.3 


93773 Bronchial epithelium TNFa (4 
ng/ml) and IL1b(1 ng/ml) ** 


35.4 


Lung 


10.7 


93347_Small Airway Epithelium_none 


-inn 
1U.9 


Lung (fetal) 


10.9 


93348_Small Airway EpitneIium_TNFa (4 
ng/ml) and ILIb (1 ng/ml) 


50.0 


■ / II 1 IV I V/ Jt 

Lung ca. (small cell) LX-1 


24.3 


92668_Coronery Artery SMC_resting 


27.9 


Lung ca. (small cell) NCI-H69 


41.5 


92669_Coronery Artery SMC_TNFa (4 
ng/ml) and IL1b (1 ng/ml) 


25.4 


I / II \ 1 1 

Lung ca. (s.cell van) SHP-77 


4.6 


931 07_astrocytes_resting 


"7 >l 

7.4 


Lung ca. (large cell)NCI-H460 


46.3 


93108_astrocytes_TNFa (4 ng/ml) and IL1b 
(1 ng/ml) 


10.7 


Lung ca. (non-sm. cell) A549 


45.4 


92666_KU-812 (Dasopnil)_resting 


3.2 


Lung ca. (non-s.cell) NCI-H23 


54.3 


92667_KU-812 (Basophil)_PMA/ionoycin 


6.7 


Lung ca (non-s.cell) HOP-62 


50.7 


93579_CCD1 106 (Keratinocytes)_none 


12.2 


Lung ca. (non-s.cl) NCI-H522 


38.4 


93580_CCD1 106 (Keratinocytes)_TNFa 
and IFNg 


10U.0 


Lung ca. (squam.) SW 900 


30.8 


93791 Liver Cirrhosis 


2/.b 


Lung ca. (squam.) NCI-H596 


15.5 


93792_Lupus Kidney 


32.3 


Mammary gland 


65.5 


93577 NCI-H292 




Breast ca.* (pi. effusion) 
MCF-7 


4.4 


93358 NCI-H292 IL-4 


70.2 


Breast ca.* (pl.ef) MDA-MB- 
231 


3.5 


93360 NCI-H292 IL-9 


54.3 


Breast ca. (pl. effusion)T47D 


8.7 


93359 NCI-H292 IL-lo 


A~7 n 
4/.U 


Breast ca. BT-549 


5.7 


93357_NCI-H292_IFN gamma 




Breast ca.MDA-N 


16.6 


93777 HPAEC - 


23.8 


Ovary 


20.5 


93778_HPAEC_IL-1 beta/TNA alpha 


21.5 


Ovarian ca. OVCAR-3 


21.6 


93254_Normal Human Lung 
Fibroblast none 


49.3 


Ovarian ca.OVCAR-4 


8.3 


93253_Normal Human Lung 

Fibroblast_TNFa (4 ng/ml) and IL-lb 
(1 ng/ml) 


40. o 


Ovarian ca.OVCAR-5 


26.1 


93257_Normal Human Lung FibroblastJL-4 


48.3 


Ovanan ca.OVCAR-8 


48.0 


93256_Normal Human Lung Fibroblast_IL-9 


29.3 


Ovarian ca.lGROV-l 


9.3 


93255_Normal Human Lung FibroblastJL- 
13 




Ovanan ca.* (ascites)SK-0V-3 


8.8 


93258_Normal Human Lung Fibroblast J FN 
gamma 


DO. 9 


Uterus 


13.4 


93106_Dermal Fibroblasts 
CCD1070_resting 


OA o 

20.2 


Plancenta 


9.4 


93361_Dermal Fibroblasts CCD1070_TNF 
alpha 4 ng/ml 


35.1 


Prostate 


21.3 


93105_Dermal Fibroblasts CCD1070_IL-1 
beta 1 ng/ml 


lO.U 


Prostate ca. (bone met)PC-3 


17.7 


93772_dermal fiDroDlast_lrN gamma 


O't Q 


Testis 


11.7 


93771 dermal nbroDlast lL-4 


0*1 o 


Melanoma nsbooiA), i 


y.u 


SO^O? IDU UOIIIIS 1 


ft ft 
o.o 


Melanoma* (met) Hs688(B).T 


12.9 


93260 IBD Colitis 2 


3.5 


Melanoma UACC-62 


12.4 


93261 IBDCrohns 


1.3 
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Melanoma Ml 4 


9.5 


73501 0_Colon_normal 


on o 


Melanoma lua iivivi 


Q i 
O. 1 


/ uDun y_Lung_none ^^^f 




Melanoma* (met) SK-MEl^ 


8.8 


64028-1 _Thymus_none 


33.5 


Melanoma SK-MEL-28 


8.0 


64030-1„Kidney_none 


21.0 



Taqman results in Table 14 show high expression of clone FCTR5 in bladder, liver and 
adrenal gland suggesting a possible role in the treatment of diseases involving these tissues. 



Table 15: Primer Design for Probe Agl541 (FCTR6) 



Primer 


Sequences 


TM 


Length 


Start Pos. 


SEQ ID 
NO 


Forward 


5'-AGAAGAACACCCCAGGGATATA-3' 


58.8 


22 


1076 


35 


Probe 


FAM-5'-CCTCGTTGGTGAACTACAACCTCTGG-3'-TAMRA 


67.9 


26 


1100 


36 


Reverse 


5'-CCTCTAGCTGGGTCACTTTCTC-3* 


59.5 


22 


1129 


37 



TABLE 16: TAQMAN RESULTS FOR FCTR6 (PANEL ID) 



Tissue_Name 


Panel 1D 
Run 1 Run 2 


Liver adenocarcinoma 


0.0 


0.0 


Heart (fetal) 


0.0 


0.0 


Pancreas 


0.0 


0.0 


Pancreatic ca.CAPAN 2 


0.0 


0.0 


Adrenal gland 


0.0 


0.0 


Thyroid 


0.0 


0.0 


Salivary gland 


0.0 


0.0 


Pituitary gland 


0.0 


0.0 


Brain (fetal) 


0.5 


0.4 


Brain (whole) 


1.1 


1.7 


Brain (amygdala) 


0.0 


1.8 


Brain (cerebellum) 


0.6 


1.9 


Brain (hippocampus) 


3.3 


3.4 


Brain (thalamus) 


1.0 


1.2 


Cerebral Cortex 


1.6 


2.6 


Spinal cord 


2.5 


0.4 


CNS ca. (glio/astro)U87-MG 


0.0 


0.0 


CNS ca. (glio/astro)U-1 1 8-MG 


0.0 


0.0 


CNSca. (astro)SW1783 


0.0 


0.0 


CNS ca.* (neuro; met )SK-N-AS 


0.0 


0.0 


CNS ca. (astro)SF-539 


0.0 


0.0 


CNS ca. (astro) SNB-75 


0.7 


0.0 


CNSca. (glio)SNB-19 


0.0 


0.0 


CNS ca. (glio)U251 


0.0 


0.0 


CNS ca. (glio)SF-295 


0.0 


0.8 


Heart 


0.0 


0.0 


Skeletal muscle 


0.0 


0.0 
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Bone marrow 




A A 

0.0 


Thymus 




A A 

0.0 


Spleen 


U.O 


A A 

0.0 


Lymph node 


0.0 


A A 
0.0 


Colorectal 


U.U 


A e 
O.D 


Stomach 


i.y 


A A 
0.0 


Small intestine 


0.0 


'I A 
1.0 


Colon ca. oW4oO 


U.U 


A A 

0.0 


Colon ca. (SW4oO met)SWo20 


0.0 


A A 

0.0 


Colon ca. HT29 


0.0 


A A 

0.0 


Colon ca. HCT-llo 


O.D 


A A 

0.4 


Colon ca.CaCo-2 


l.O 


A A 
0.0 


83219 CC Well to Mod Dlff (ODOoood) 


0,0 


A A 
0.0 


Colon ca.HCC-299o 


0.0 


A A 
0.0 


Gastric ca. (liver met) NCI-No7 


1.2 


A A 
0.0 


Dladaer 


0.0 


A A 
0.0 


Trachea 


0.0 


A A 
0.4 


Kidney 


f\ Q 

O.o 


1 .2 


Kidney (fetal) 


0.5 


A 7 

0.7 


Renal ca. 786-0 


0.0 


A A 
0.0 


Renal ca.A498 


0.0 


A A 
0.0 


Renal ca.RXF 393 


0.0 


A A 

0.0 


Renal ca.ACHN 


0.0 


A A 

0.0 


Renal ca. UO-31 


0.0 


A A 

0.0 


Renal ca.TK-10 


0.0 


A A 

0.0 


Liver 


A A 

0.0 


A A 

0.0 


Liver (fetal) 


0.2 


A A 

0.0 


Liver ca. (hepatoblast) HepG2 


A A 

0.0 


A A 

0.0 


Lung 


A A 
0.0 


A A 
0.0 


Lung (fetal) 


A A 

0.0 


A A 

0.0 


Lung ca. (small cell) LX-1 


1.7 


2.3 


Lung ca. (small cell)NCI-H69 


0.0 


n A 

0.0 


Lung ca. (s.cell var.) SHP-77 


1.3 


A C 

2.5 


Lung ca. (large cell)NCI-H460 


A r\ 

0.0 


A A 

0.0 


Lung ca. (non-sm. cell) A549 


0.0 


0.0 


Lung ca. (non-s.cell) NCI-H23 


1.2 


0.4 


Lung ca (non-s.cell) HOP-62 


0.0 


A A 

0.0 


Lung ca. (non-s.cl) NCI-rl522 


A A 

0.0 


A A 

0.0 


Lung ca. (squam.) SW 900 


A A 

0.0 


A "7 
0.7 


Lung ca. (squam.) NCI-H596 


n A 

0.0 


A O 

1.3 


Mammary gland 


A A 

0.0 


A C 

1.5 


Breast ca. (pl. effusion) MCF-7 


A A 

0.0 


A A 

op 


Breast ca. (pi.ef) MDA-Md-231 


O.O 


A C 

0.5 


Breast ca. (pl. effusion) T47D 


A A 

1.2 


A O 

0.3 


Breast ca. BT-549 


0.5 


A A 

0.0 


Breast ca. MDA-N 


A A 

0.0 


A A 
0.0 


Ovary 


A A 
0.0 


A A 
U.O 


Ovanan ca. OVCAR-3 


A A 
0.0 


A A 
0.0 


Ovanan ca.OVCAK-4 


A A 
0.0 


A A 
U.U 


Ovanan ca.OVCAK-5 


O.D 


A 7 
U. 1 


Ovanan ca.OVCAK-o 


A A 
0.0 


A A 
U.U 


Ovarian ca.lGROV-1 


A A 
0.0 


A A 
0.0 


Ovanan ca. (ascites) SK-OV-3 


A A 
0.0 


A A 

0.0 


Uterus 


A A 

0.0 


A A 

0.0 


Plancenta 


A A 
0.0 


A A 
0.0 


Prostate 


A A 

0.0 


A 1 
0. t 


prostate ca. (Done met;PU-o 


U.U 


U.U 


Testis 


100.0 


100.0 


Melanoma Hs688(A).T 


0.0 


0.0 
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Melanoma* (met) Hs68aj|^T 




0.0 


Melanoma UACC-62 




0.0 


Melanoma M14 


0.0 


0.0 


Melanoma loa imvi 


U.U 


U.U 


Melanoma* (met)SK-MEL-5 


0.0 


0.0 


Adipose 


0.5 


0.0 



Table 17: Taqman Results for FCTR6 (Panel 2D) 



Tissue Name 


Panel 2D 
Run 1 Run 2 


Normal Colon GENPAK 061003 


5.4 


2.4 


83219 CC Well to Mod Diff (OD03866) 


7.3 


0.0 


83220 CC NAT (OD03866) 


5.8 


1.5 


83221 CC Gr.2 rectosigmoid (OD03868) 


3.4 


0.0 


83222 CC NAT (OD03868) 


0.0 


0.0 


83235 CC Mod Diff (ODO3920) 


11.0 


1.4 


83236 CC NAT (ODO3920) 


0.0 


0.0 


83237 CC Gr.2 ascend colon (OD03921) 


6.2 


2.5 


83238 CC NAT (OD03921) 


10.2 


0.0 


83241 CC from Partial Hepatectomy (ODO4309) 


3.6 


0.0 


83242 Liver NAT (ODO4309) 


0-0 


2.4 


87472 Colon mets to lung (OD04451-01) 


7.2 


4.4 


87473 Lung NAT (OD04451-02) 


0.0 


0.0 


Normal Prostate Clontech A+ 6546-1 


4.8 


2.9 


84140 Prostate Cancer (OD04410) 


3.5 


0.0 


84141 Prostate NAT(OD04410) 


3.4 


0.0 


87073 Prostate Cancer {OD04720-01) 


9.0 


8.5 


87074 Prostate NAT (OD04720-02) 


0.0 


0.0 


Normal Lung GENPAK 061010 


17.7 


6.5 


83239 Lung Met to Muscle (OD04286) 


0.0 


2.3 


83240 Muscle NAT (OD04286) 


0.0 


0.0 


84136 Lung Malignant Cancer (OD03126) 


6.5 


5.7 


84137 Lung NAT (OD03126) 


0.0 


0.0 


84871 Lung Cancer (OD04404) 


0.0 


0.0 


84872 Lung NAT (OD04404) 


0.0 


0.0 


84875 Lung Cancer (OD04565) 


0.0 


0.0 


85950 Lung Cancer (OD04237-01) 


0.0 


0.0 


85970 Lung NAT (OD04237-02) 


0.0 


0.0 


83255 Ocular Mel Met to Liver (ODO4310) 


A 1 

H.O 


U.U 


83256 Liver NAT (ODO4310) 


0.0 


0.0 


84139 Melanoma Mets to Lung (OD04321) 


0.0 


0.0 


84138 Lung NAT(OD04321) 


0.0 


0.0 


Normal Kidney GENPAK 061008 


28.1 


39.2 


83786 Kidney Ca, Nuclear grade 2 {OD04338) 


0.0 


3.0 


83787 Kidney NAT (OD04338) 


22.7 


31.6 


83788 Kidney Ca Nuclear grade 1/2 (OD04339) 


0.0 


3.1 


83789 Kidney NAT (OD04339) 


97.3 


100.0 


83790 Kidney Ca. Clear cell type (OD04340) 


0.0 


0.0 


83791 Kidney NAT (OD04340) 


100.0 


34.4 


83792 Kidney Ca. Nuclear grade 3 (OD04348) 


2.0 


4.9 


83793 Kidney NAT (OD04348) 


30.2 


19.9 


87474 Kidney Cancer (OD04622-01) 


0.0 


2.4 


87475 Kidney NAT (OD04622-03) 


8.4 


7.2 


85973 Kidney Cancer (OD04450-01) 


0.0 


0.0 


85974 Kidney NAT (OD04450-03) 


47.3 


12.9 
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Kidney Cancer Clonted||^0607 


-IP— 


0-0 


Kidney NAT Clontech ^pbos 




0.0 


Kidney Cancer Clontech b120613 


0.0 


0.0 


Kidney NAT Clontech 8120614 


20.6 


22.9 


Kidney Cancer Clontech 9010320 


0-0 


0.0 


Kidney NAT Clontech 9010321 


3,4 


26.4 


Normal Uterus GENPAK 061018 


0.0 


0.0 


Uterus Cancer GENPAK 06401 1 


14.9 


0.0 


Normal Thyroid Clontech A+ 6570-1 


0.0 


0.0 


Thyroid Cancer GENPAK 064010 


0.0 


0.0 


Thyroid Cancer INVITROGEN A302152 


0.0 


0.0 


Thyroid NAT INVITROGEN A302153 


0.0 


0.0 


Normal Breast GENPAK 061019 


5.2 


3.5 


84877 Breast Cancer (OD04566) 


0.0 


0-0 


85975 Breast Cancer (OD04590-01) 


0.0 


0.0 


85976 Breast Cancer Mets (OD04590-03) 


0.0 


0.0 


87070 Breast Cancer Metastasis (OD04655-05) 


0.0 


0-0 


GENPAK Breast Cancer 064006 


0.0 


2.5 


Breast Cancer Clontech 91 00266 


6.2 


0-0 


Breast NAT Clontech 9100265 


0.0 


0.0 


Breast Cancer INVITROGEN A209073 


1.5 


2.5 


Breast NAT INVITROGEN A2090734 


24.3 


26.2 


Normal Liver GENPAK 061009 


10.5 


2.7 


Liver Cancer GENPAK 064003 


5-9 


1-7 


Liver Cancer Research Genetics RNA 1025 


21.6 


11.0 


Liver Cancer Research Genetics RNA 1026 


0.0 


0.0 


Paired Liver Cancer Tissue Research Genetics RNA 6004-T 


3.3 


13.5 


Paired Liver Tissue Research Genetics RNA 6004-N 


3.2 


1.4 


Paired Liver Cancer Tissue Research Genetics RNA 6005-T 


0.0 


0.0 


Paired Liver Tissue Research Genetics RNA 6005-N 


0.0 


0.0 


Normal Bladder GENPAK 061001 


0.0 


0.0 


Bladder Cancer Research Genetics RNA 1023 


0.0 


0.0 


Bladder Cancer INVITROGEN A302173 


4.6 


2.3 


87071 Bladder Cancer (OD0471 8-01) 


17.9 


11.4 


87072 Bladder Normal Adjacent (OD04718-03) 


0.0 


0.0 


Normal Ovary Res. Gen. 


0.0 


0.0 


Ovarian Cancer GENPAK 064008 


1.7 


4.8 


87492 Ovary Cancer (OD04768-07) 


0.0 


2-1 


87493 Ovary NAT (OD04768-08) 


0.0 


0-0 


Normal Stomach GENPAK 061017 


3.3 


2.9 


NAT Stomach Clontech 9060359 


0.0 


0.0 


Gastric Cancer Clontech 9060395 


0-0 


0.0 


NAT Stomach Clontech 9060394 


0.0 


0.0 


Gastric Cancer Clontech 9060397 


0.0 


0.0 


NAT Stomach Clontech 9060396 


0.0 


0.0 


Gastric Cancer GENPAK 064005 


6.3 


3.8 



Table 18: Taqman Results for clone 27455183.0.19 (Panel 4D) 



Tissue_Name 


Panel 4D 
Run 1 Run 2 


93768_Seconclary Th1_anti-CD28/anti-CD3 


0.0 


0.0 


93769_Secondaty Th2_anti-CD28/anti-CD3 


0.0 


0.0 


93770_Secondaty Tr1_anti-CD28/anti-CD3 


13.5 


17.1 


93573_Secondary Th1_resting day 4-6 in IL-2 


0.0 


0.0 


93572_Secondary Th2_resting day 4-6 in IL-2 


0.0 


0.0 
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93571_Secondary Trl^^ng day 4-6 in IL-2 


0.0 


0.0 


93568_primary Th1_aiWB)28/anti-CD3 


0.0 


0.0 


93569 primary Th2_antiT;D28/anti-CD3 


0.0 


0,0 


93570 primary Tr1_anti-CD28/anti-CD3 


0.0 


0.0 


93565 jDrimary Th1_resting dy 4-6 in IL-2 


0.0 


0.0 


93566_primary Th2_resting dy 4-6 in IL-2 


0.0 


0.0 


93567_primary Tr1_resting dy 4-6 in IL-2 


0.0 


0.0 


93351 _CD45RA CD4 lymphocyte_anti-CD28/anti-CD3 


0.0 


0.0 


93352_CD45RO CD4 lymphocyte_anti-CD28/anti-CD3 


0.0 


0,0 


93251_CD8 Lymphocytes_anti-CD28/anti-CD3 


0.0 


0.0 


93353_chronic CD8 Lymphocytes 2ry_resting dy 4-6 In IL-2 


0.0 


0.0 


93574_chronic CD8 Lymphocytes 2ry_activated CD3/CD28 


0.0 


0.0 


93354 CD4 none 


5.8 


0.0 


93252_Secondary Th1/Th2/Tr1_anti-CD95 CH11 


0.0 


0.0 


93103_LAK cells_resting 


0.0 


0.0 


93788_LAK cellsJL-2 


0.0 


0.0 


93787_LAK cellsJL-2+IL-12 


0.0 


0.0 


93789_LAK cellsJL-2+IFN gamma 


0.0 


0,0 


93790_LAK cellsJL-2+ IL-18 


0.0 


0,0 


93104_LAK cells_PMA/ionomycin and IL-18 


0,0 


0.0 


93578_NK Cells IL-2_resting 


0.0 


0.0 


93109_Mixed Lymphocyte Reaction_Two Way MLR 


0.0 


0.0 


93110_Mixed Lymphocyte Reaction_Two Way MLR 


0.0 


0.0 


93111_Mixed Lymphocyte Reaction_Two Way MLR 


0,0 


0.0 


93112_Mononuclear Cells (PBMCs)_resting 


0,0 


0.0 


93113_Mononuclear Cells (PBMCs)_PWM 


0.0 


0.0 


93114_Mononuclear Cells (PBMCsLPHA-L 


0.0 


0.0 


93249_Ramos (B cell)_none 


0.0 


38.2 


93250_Ramos (B cell)_ionomycin 


0.0 


0.0 


93349^8 lymphocytes_PWM 


0.0 


68.8 


93350_B lymphoytes„CD40L and IL-4 


31.0 


0.0 


92665_EOL-1 (Eosinophil)_dbcAMP differentiated 


0.0 


0.0 


93248_EOL-1 (Eosinophil)_dbcAMP/PMAionomycin 


0.0 


0.0 


93356 Dendritic Cells none 


0.0 


0.0 


93355_Dendritic Cells_LPS 100 ng/ml 


0.0 


0.0 


93775 Dendritic Cells anti-CD40 


32.5 


0.0 


93774_Monocytes_resting 


0.0 


0.0 


93776_Monocytes_LPS 50 ng/ml 


0.0 


0.0 


93581 _Macrophages_resting 


0.0 


0.0 


93582_Macrophages_LPS 100 ng/ml 


0.0 


0.0 


93098_HUVEC (EndothelialLnone 


0,0 


0.0 


93099_HUVEC (EndothelialLstarved 


11.3 


0.0 


93100_HUVEC (Endothelial)JL-1b 


0.0 


14.6 


93779_HUVEC (Endothelial)JFN gamma 


0.0 


0.0 


93102_HUVEC (Endothelial)_TNF alpha + IFN gamma 


0.0 


0.0 


93101 HUVEC (Endothelial) TNF alpha + IL4 


0.0 


0.0 


93781_HUVEC (Endothelial)JL-ll 


0.0 


0.0 


93583_Lung Microvascular Endothelial Cells_none 


0.0 


0.0 


93584 Lung Microvascular Endothelial Cells TNFa (4 ng/ml) and ILIb 
(1 ng/ml) 


0.0 


0.0 


92662 Microvascular Dermal endothelium none 


0.0 


0.0 


92663_Microsvasular Dermal endothelium_TNFa (4 ng/ml) and ILIb (1 
ng/ml) 


0.0 


0.0 


93773_Bronchial epithelium_TNFa (4 ng/ml) and ILIb (1 ng/ml) ** 


0.0 


0.0 


93347_Small Airway Epithelium_none 


0,0 


0,0 


93348_Small Airway Epithelium_TNFa (4 ng/ml) and ILIb (1 ng/ml) 


0.0 


0,0 


92668 Coronery Artery SMC_resting 


0,0 


0.0 


92669„Coronery Artery SMC_TNFa (4 ng/ml) and IL1b (1 ng/ml) 


0.0 


0.0 


931 07_astrocytes„resting 


0.0 


0.0 
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93108 astrocytes TN|A|ng/ml) and IL1b (1 ng/ml) 


0.0 


0.0 


92666 KU-812 (Baso|BB-esting 


0.0 


40.3 


92667_KU-812 (BasophilLPMA/ionoycin 


0.0 


0.0 


93579_CCD1 106 (Keratinocytes)_none 


0.0 


0.0 


93580_CCD1106 {Keratinocytes)_TNFa and IFNg ** 


0.0 


0.0 


93791 Liver Cirrhosis 


100.0 


99.3 


93792 Lupus Kidney 


0.0 


0.0 


93577 NCI-H292 


0.0 


0.0 


93358 NCI-H292 IL-4 


0.0 


0.0 


93360 NCI-H292 IL-9 


10.6 


0.0 


93359 NCI-H292 IL-13 


0.0 


65.5 


93357_NCI-H292JFN gamma 


0.0 


24.8 


93777 HPAEC - 


0.0 


0.0 


93778_HPAEC_IL-1 beta/TNA alpha 


0.0 


0.0 


93254_Normal Human Lung Fibroblast_none 


0.0 


0.0 


93253_Normal Human Lung Fibrob!ast_TNFa (4 ng/ml) and IL-1 b (1 
ng/ml) 


0.0 


0.0 


93257_Normal Human Lung Fibroblast_IL-4 


0.0 


0.0 


93256_Normal Human Lung FibroblastJL-9 


0.0 


0.0 


93255_Normal Human Lung Fibroblast_IL-13 


0.0 


0.0 


93258_Normal Human Lung FibroblastJFN gamma 


0.0 


0.0 


93106_Dermal Fibroblasts CCD1 070_resting 


0.0 


0.0 


93361 Dermal Fibroblasts CCD1070_TNF alpha 4 ng/ml 


0.0 


43.8 


93105_Dermal Fibroblasts CCD1070_IL-1 beta 1 ng/ml 


0.0 


0.0 


93772_dermal fibroblastJFN gamma 


42.0 


27.7 


93771 dermal fibroblast IL-4 


10.7 


90.1 


93259 IBD Colitis 1** 


0.0 


0.0 


93260 IBD Colitis 2 


13.8 


0.0 


93261 IBDCrohns 


0.0 


46.7 


735010 Colon normal 


15.6 


0.0 


73501 9_Lung_none 


12.9 


16.8 


64028-1_Thymus_none 


69.3 


100.0 


64030-1 _Kid ney_non e 


0.0 


0.0 



Taqman results in Table 18 demonstrate that clone FCTR6 is differentially expressed in 
clear cell Renal ceil carcinoma tissues versus the normal adjacent kidney tissues and thus could 
have a potential role in the treatment of renal cell carcinoma. 



EQUIVALENTS 

Although particular embodiments have been disclosed herein in detail, this has been done 
by way of example for purposes of illustration only, and is not intended to be limiting with 
respect to the scope of the appended claims which follow. In particular, it is contemplated by the 
inventors that various substitutions, alterations, and modifications may be made to the invention 
without departing from the spirit and scope of the invention as defined by the claims. The choice 
of nucleic acid starting material, clone of interest, or library type is believed to be a matter of 
routine for a person of ordinary skill in the art with knowledge of the embodiments described 
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herein. Other aspects,^j^ntages, and modifications considere^Blse within the scope of the 
following claims. 
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